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ABSTRACT 

This thesis represents an attempt fo character ze 
Congressional appropriations for Defense procurement and 
research and development between 1953 and 1973. The approach 
used involves formulating alternative models of appropriations 
as a percentage of requests and deriving point and interval 
estimates for parameters using robust statdetieal procedures. 
The results of the analysis lend support to the hypothesis 
that the response of the Congress to Defense procurement and 
research and development budget requests has changed consider- 
poly, Ssterting in fiscal 1969 and that the Air Force received 
uniquely favorable action on its research and development 
budget requests in the 1957-1969 time period. 

The first chapter explores the institutional setting of 
Defense budgeting, from the budget prepreparation stage Througn 
the audit stage. In Chapter Il, alternative models are formu- 
lated for appropriations as a percentage of requests, parameters 
are estimated using least squares, and residuals are analyzed. 
The following two chapters explore the advantages of robust 
alternatives to least squares and include robust point and 
interval estimates for the parameters of the models of Chapter 
tI. The final chapter includes (i) an examination, using Monte 
Carlo techniques, of the operating characteristics of the 
statistical procedures employed in earlier chapters and (ii) a 
discussion of the results of dividing the data by groups OL 


years and by service. 
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"The budget process. . . is primarily a system 

of communications, regularized and cyclical. IDys 
purposes fall into two logical categories: first, 
the bringing of information to the proper level 

for the making of decisions — a category in gov- 
ernmental policies, programs and objectives which 

we may roughly classify as policy; and, second, the 
providing of information both upward and downward 

so that those decisions will be carried out — a 
category we may roughly classify as administrative."* 


Frederick Mosher 


The topic of this thesis is the regularized and cyclical 
system of communications which forms the Department of 
Defense budget process. At various points in the budget 
cycle, decisions or budget outcomes are recorded for both 
policy and administrative reasons. Some of the data in 
these records will be analyzed in order to draw inferences 
about the process of decision-making on the defense budgets. 
Because budget data is basically quantitative data, the 
analyses that will be undertaken will use quantitative tech- 
niques to formulate and test hypotheses concerning the 
defense budget process. In addition to attempting to 


rr a 


*Prederick Mosher, Program Bua@aetings Theory and 
Practice, Chicago, Public Raministration Service, 1954, p. 5. 





characterize decision-making in defense budget making, this 
thesis will try to show the potentialities of quantitative 
analysis of public budgeting systems. 

Most analyses of federal budgeting have focused on one 
or two parts of the federal budget cycle. The program budget 
literature has mainly addressed the budget preparation stage, 
While either ignoring or treating in a cursory fashion the 
Congressional review stage [Hitch,1965, Hitch andMcKean, 1960, Novick,1965]. 
Quantitative analysis of budget preparation has been confined 
to the formulation and testing of hypotheses based on infor- 
mation about this stage alone. The literature on Congressional 
review of budget requests has focused on interactions within 
the Congress and between Congressional Committees and agencies 
rranns 1966, Wideysly 1061 Horn, 1970]. The budget preparation stage and 
budget executicn stage have not been considered in this liter- 
ature, except to a certain extent by [Wildavsky,1964]. Quantitative 
analysis of Congressional budget behavior has been dominated 
by those who have tried to operationalize some of the Fenno 
and Wildavsky statements concerning Committee-agency inter- 
actions. [Davis, Dempster and Wildavsky, 1966,1967,1971, 
Johnson,1972, Davis and Guillot,1967]. 

The analysis contained in succeeding chapters focuses on 
the Congressional budget review stage of the budget process. 
N@@emopeciiically, it deals with Congressional review of 
Department of Defense budget requests for aircraft, missiles 
Piemieeseatch and development. In contrast to the work of 


tow, Dempster, Wildavsky and others, this study concentrates 


nied 





on just one agency rather than many, and uses recently developed 
statistical techniques to analyze budget data for the years 
1953-1973. 

The somewhat different focus of this stud¥eis the result 
a number of developments that have taken place following the 
publication of the first papers of Davis, et al. First, a 
recent study by James Jernberg [ Jernberg,1972] shows that different 
Congressional appropriations subcommittees have different 
interests and different approaches to budget review. Some are 
interested in budget components such as personnel and travel 
expenditures, while others are interested in outputs or program 
objectives. Instead of undertaking the ambitious task of 
eaaracteri zing all Congressional budget review activity, Chis 
analysis concentrates solely on such activity within the 
Department of Defense. Second, a recent and noteworthy develop- 
ment is the objection by Johnson [1967] , Kanter [1972] , and 
others [Natchez and Bupp,1973], to the effect that the studies by 
Davis et al. have been done at too high a level of aggregation. 
The models Davis et al. propose are for total agency budget 
requests and total agency appropriations. For Defense budgets, 
Kanter and others [Stromberg,1970] have suggested conducting 
analysis at a lower level of aggregation such as Navy RDTE 
(Research, Development, Test and Evaluation). It is at this 
lower level that our study is conducted. Finally, a 
methodological development that has gathered momentum since 
maecepublication of the first budgeting articles by 
Peas et al. ts that of the theory and techniques 


Of robust estimation [Andrews et al, 1973]. These are 


ahabal 
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particularly well-suited to the objectives of our analysis 
of defense budget data, which is based on the belief that 
Congressional behavior since the early Fifties demonstrates the 
characteristics of a succession of regular, stable processes mixed 
with a fewsomewhat unuSual events. The data analysis problem is 
to separate the two, and to prevent the unusual events from 
contaminating or distorting the estimates of the character- 
istics of the regular process. 

The analysis to be presented omits one aspect of the 
budget process which forms a part of the Davis, Dempster, 
Wildavsky studies. Little attempt is made to structure 
detailed mechanistic models of both the process of agency 
budget request formulation, or of Congressional review. One 
reason for this omission is that in the Department of Defense, 
the process of formulation of budgets has changed consider- 
ably since the Fifties with the implementation of planning, 
programming and budgeting (PPB) and various revisions to the 
PPB system. The Boe of the detailed process of budget 
request formulation could conceivably result in simultaneous 
equation problems [Johnston,Ch. 13and 14, 1963]. However for the 
Simple models used in this analysis, single equation methods 
have been shown to be acceptable [Wold,1953, Johnston,1963, pp 377- 
380]. Although the theory of robust estimation has not yet 
been extended to simultaneous equation estimation, we do 
fairly extensive Monte Carlo investigations of efficiency and 
mis-specification problems. Our results reflect favorably 
upon the operating characteristics of the robust statistical 


estimation procedures put to use. 
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The chapters which follow combine (a) an analysis of sub- 
stantive questions which arise from trying to characterize 
Congressional activity in the area of defense budgeting with 
(b) a detailed consideration of methodological questions which 
arise when trying to apply various statistical estimation 
procedures to data like that on defense appropriations. 
Chapter I is an overview of the defense budget process from 
budget prepreparation to budget execution. The purpose of 
the chapter is to place in perspective the part of the process 
which is being studied in this analysis and for the first time to 
gather into one package a narrative description of the processes ana 
institutions of defense budgeting as it has been and is 
Sulremrly cOnducted. Chapter Il reviews the Davis, Dempster, 
Wildavsky work and proposes some e@lternative approaches and 
mocels tor Congressional appropriations. Estimates for 
coefficients or parameters in these models are studied. In 
Chapter III the methodological literature on robust estimation 
is reviewed, and new estimates for the models are derived. 
Confidence intervals for the estimates are presented in detail 
iomeiapacer LY =Goegcether with a discussion of the methods for 
constructing the estimates. Chapter V employs Monte Carlo 
Peemmrdues 1%) Order tO gain insight into the validity of the 
techniques used in earlier chapters. The result of the Monte 
Carlo studies in Chapter V also point to some important 
substantive conclusions about which of the models best 
Simacterizes what will be termed the regular part of the 
Gefense budget process, and for which years these models are 


mosey appropriate. 





CHAPTER I 


Descriptions of the federal budget process in the United 
States can focus either on a sequence of events or on the 
miperacltlons among the major institutions in therpolitical 
system. One can first view the process as an annual sequence 
of events which includes budget prepreparation, budget 
preparation, budget submission and Office of Management and 
Budget (OMB) review, budget submission and Congressional 
review, budget execution, audit. Each of these steps or 
events, however, represents activitiés and possibly interactions 
©f three major political instituions: agency, President and 
Congress. When studying the process, the analyst can either 
focus on the events and attempt to examine the respective 
motes Of the institutions at each step, or he can focus on the 
Distitucvions and their interactions, with the budgetary 
Sequence of events providing the setting. The discussion which 
follows is concerned with budgeting in the defense sector. It 
will be organized, as depicted in Figure I.1, from the sequential 
point of view, describing in succession budget prepreparation, 
budfet preparation, OMB review, Congressional review, budget 


execution and audit. However, the reader should bear in mind 
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that at each of these stages all of the three major actors 
or institutions have some direct or indirect impact on the 
activities which take place. 

As was stated in the introduction, the purpose of this 
chapter is to convey a picture of the defense budget process 
as a whole so that when a selected portion of the process is 
analyzed in detail in succeeding chapters, the reader will 
have some idea of how the analysis fits into a broad view of 
the process. A secondary purpose of this chapter is to 
bring together in one place current information and literature 


on defense budgeting. 


T.1. Budget Prepreparation 


Programn Budgeting 


— 


In the Department of Defense, budget prepreparation occurs 
during the 18-month Planning, Programming and Budgeting cycle. 
Planning, programming and budgeting is an attempt to tie 
planned resource usage to objectives or goals [Schick, p. 33]. 
In simple terms, it is an attempt to budget in terms of out- 
puts instead of inputs. For example, program budget decision- 
makers who are considering a health-care budget will be 
concerned with the budget division between health care for the 
elderly and child health care rather than the budget division 
between travel expenses (for all programs), rents, and 
maintenance. Budget review in terms of travel, rents, 
maintenance, etc. is generally considered to be budgeting in 


terms of objects of expenditure [Schick,1972,pp.20-21, Johnson ,1972, 





p. 2]. According to Allen Schick, program budgets are 
associated with a planning orientation, that is, the budget 
is a forward planning device, while object budgets have a 
control orientation, with the budget used as a method of 
insuring honesty and integrity [Schick,1972, pp. 20-23 and pp. 
30-36]. 

When Robert McNamara became Secretary of Defense in 1961, he 
and the newComptroller, Charles Hitch, committed the Depart- 
ment of Defense (DOD) to the basic principles of program 
Budgeting: multiyear budgets cast in terms of outputs [Hitch, 
1965,p.27, Ruefli,1971l,pp. 166-169]. The Defense budget was divided 
among nine major programs or missions: 

Strategic Retaliatory Forces 

Continental Defense 

General Purnese Forces 

Arle Peyseat l(t 

Reserve and National Guard 

Research and Development 

General Support 

Retired Pay 

Military Assistance 
The force levels and dollars allocated to the above programs 
for the next five years were called the Five Year Defense 
Program. Within each program or mission were program elements 
which were supposed to contribute to the objectives or output 
of the program (See Table I.1). In theory, the program 
elemecnios are supposed to be Substitutes for each other, such 
as B-52's, land based missiles, Polaris submarine-launched 
missiles within the Strategic Retaliatory Forces program. 


Other considerations involved in the decision of where to 


place program elements are discussed by Smithies [Smithies,1965, 
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TABLE I.1 


FIVE YEAR DEFENSE PROGRAM 


Outline of Program Structure 


(with selected examples of program elements) 


STRATEGIC FORCES 
1IXXX. Offensive Forces 


11113F: B-52 Squadrons 
11213F: Minuteman Squadrons 
11221N: Fleet Ballistic Missile System 


12XXX. Defensive Forces 


12114F: F-106 Squadrons 


12214A: 


Sentinal System 


12427N: SPASUR 


13XXX. Civil Defense 
1311C: 


Shelter Survey 


GENERAL PURPOSE FORCES 
21XXX. Unified Commands 


21ii40: PACOM 
22XXX. Forces (Army) 

22113A: Infantry Divisions 

22122A: Mechanized Brigades 

22222A: Helicopter Companies (Medium) 

22233A: Hawk Battalions 

22313A: Construction Engineering Battalions 
23XXX. Other Support (Army) . 

23196A: Base Operations (Europe) 

23296A: Base Operations (Pacific) 

23613A: Redeye (Operational Systems Development) 
24XXX. Forces (Navy) 

24114N: A-6 Squadrons 

24141N: Attack Carriers 

24221N: P-3 Squadrons 

24231N: ASW Carriers 

24311N: Submarines 

24411N: Fleet Escort (Major) 

24514N: Coastal/River Patrol and Assault Forces 

24611N: Mine Countermeasures Ships 

24711N: Underway Replenishment Ships 





25XXX. Fleet Support 
25196N: Base Operations 


25613N: Phoenix Missile System (Operational Systems Devel. ) 


26XXX. Fleet Marine Forces 
26122M: CH-53 Squadrons 
26211M: Divisions 
26311M: Force Troops (Combat Support) 
26496M: Base Operations 
26612M: Helicopter Avionics (Operational Systems Devel. ) 


27XXX. Forces (Air Force) 
27129F: F-111 Squadrons 
27213F: RF-4 Squadrons 
27241 F: Special Air Warfare Forces (SAWF) 
27311F: MACE 
27412F: Tactical Air Control System 
27596F: Base Operations 


BepAXX. Other 
Zeoll0: JCS Di 


ected and Coordinated Exercises 
Z8N15N: De z 


5 
Ie C2 


— e 


INTELLIGENCE AND COMMUNICATIONS 

mean X. Intelligence and Security 
euO110: Cryptologic Activities 
310130: Defense Attache System 


32XXX. National Military Command System 
32011 F: National Military Command Center 


33XXX. Communications 
33111A: STARCOM 
$3115K: DCA Satellite Project 


34XXxX. Special Activities 
342110: National Activities 


35XXX. Activities (Other) 
35110F: Satellite Control Facility 
35112N: Oceanography 
35120A: Combat Development Activities 


ae 





pp. 34-43]. The system of categories, including major 


programs and program elements, is called a program structure. 


Ideally, a program structure is supposed to assign activi- 


ties (program elements) to mutually exclusive programs. For 


optimal allocation of resources within a program (presuming 
an absence of problems like indivisibilities), it is necessary 
that the marginal productivity of a resource allocated to a 
program element equal the marginal productivity of that 
resource with respect to every other program element within 
the program. For optimal resource allocation between programs 
which yield different outputs the marginal utility of a 
resource allocated to one program must equal the marginal 


utility of the resource with respect to every other program. 


Program Budget Cycle 


Currently, the program budget cycle is an 18-month cycle 
conducted in three phases and resulting in the President's 
annual budget submission to the Congress in January. The 
first phase involves military threat and requirement evalua- 
tion. In the second phase, multiyear programs to meet the 
threat are derived. These programs and the dollars allocated 
to them are listed in the Five Year Defense Program. The 
third phase of the cycle is the development of the annual 


(next year's) budget.* 


*The rationale behind the three phases is discussed by 
Hitch in [Hitch, pp. 28-39, 1965). 





Since its inception, each phase of the program budget 
cycle has included extensive discussion and negotiation 
between the Office of the Secretary of Defense (OSD) and the 
military services. The general approach in each phase has 
been the issuance of some kind of guidance or constraints by 
OSD, such as fiseal or strategy guidance; next the services 
respond with proposals; OSD approves, disapproves or modifies 
the proposals; finally, some kind of agreement is reached 
before the next phase is entered [Kanter and Anger,1973]. In the 
McNamara era (1961-1969), OSD appeared to be controlling the 
process [Enke,1963, Crecine,1971]. Under Melvin Laird and David 
Packard the role of the military services has apparently 
been emphasized and broadeneda* [Kanter andAnger,1973,pp. 6-10]. 

iMimoirder LO Dass 1rCm thc seconc phase of the program 
BPudeet cycle, programming, into the third phase, budgeting, 
or in terms of this analysis to pass from budget preprepara- 
meen tO budget preparation, it 18 necessary to translate Che 
program budget into a different system of accounts. The 
Bemeress does not appropriate funds specifically for the 
program elements in the Five Year Defense Program. Rather, 
maey eonsider the inputs or resources which produce the 


program element force levels. The budget submitted to the 


*It is difficult to yet assess the differences between 
PPB under Laird and PPB under Secretary James Schlesinger. 
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Congress is an input-oriented budget (although it is not 
called an object budget). The general categories or line 
items are: * 

Military Personnel (MPN) 

Operations and Maintenance (OMN) 

Procurement (PAMN, OPN) 

Research and Development (RDTE) 

Military Construction (MILCON) 
These categories are further subdivided into budget activities 
such as Permanent Change of Station Travel (Navy), 

Since the budget enacted by the Congress is in terms of 

dollar allocations for resources or inputs and the program 
Pmeagceie.s in terms of dollar allocations for outputs, it is 


necessary each year to translate or crosswalk the program budget 


mmto a resource budget. 


Crosswalking 


Crosswalking is a term applied to the transformation of 
ime program budget into a line item resource budget, or vice 
versa. Translation from a program budget into a line item 
budget should be rather straightforward, as the following 
discussion will demonstrate. 

A program element within the Navy Tactical Air Program 
or Mission is program element 2—-41-22-N, F-14 squadrons. 

The approved budgets (hypothetical) for F-14 squadrons for 


FY 72 through FY 76, as contained in the FYDP, are (in 


*The actual budget submission to the Congress contains 
Socimavces for Army, Navy, Air Force, Marine Corps and Defense 
Agencies in most of these accounts and the Procurement 
account is further divided among aircraft, missiles, ships, 
torpedoes, combat tracked vehicles and other procurement. 


T-8 





millions of dollars): 100, 150, 444.2, 398.027, 240.172. 
The following table shows how these dollars are used in terms 


of the line item categories: 





T2 45; 74 185) 76 
MPN. 182 4.290 
OMN 6.326 14.079 
PAMN# 413.5 310.00 217.69 
OPN* ay Wee 
RDTE 100 50 30.7 150.072 
foetal: 100 150 WHY 2 467.987 2 UOialny 2 


*PAMN stands for Procurement Air and Missile Systems Navy, 
OPN stands for Other Procurement Navy, 

RDTE stands for Research, Development, Test and Evaluation, 
OMN stands for Operations and Maintenance, Navy 

MPN stands for Military Personnel, Navy 


**The above numbers are hypothetical numbers only. 


TABLE I.1la 


These hypothetical budgets will supposedly yield eleven 
squadrons in FY75 and forty-four in FY76. The information 
contained in the table is called Program Element Summary Data. 
m> Obtain the Navy line item budget in RDTE for FY7e2 it is 
Only necessary to sum over all the program elements the RDTE 
dollars they require. The same is true for the other line 
item categories. 

Crosswalking from line item categories to program cate- 
Boeries 1S not quite as straightforward, since it requires 


allocation of an overall budget figure for an input such as 


Wy 





Military Personnel or RDTE among all program elements. The 
problems involved in crosswalking from line item categories 
to program categories have been addressed in [Crecine and 
Fischer, 1971, Ruef1i,1971]. 

If we aggregate dollars for program elements, such as the 
F~-14, into dollars for the major programs of which these 
elements form a part, then a crosswalked budget for the 


defense department might look something like Table 1.2. 
tic. Budget Preparation 


The Process 


Phase Iil of the program budgeting cycle involves the 
meeparation of the annual budget. As discussed in the 
Meevious section, the budget which will be Sent to the Cong. 
is in terms of line items rather than programs. Budget esti- 
mates are submitted by the services to the Secretary of Defense 
Or or about October 1 of each year. These estimates are then 
sent to the Assistant Secretary of Defense (Comptroller), 
ASD(C), who in turn divides the budget review among director- 
ates for military personnel, operations and maintenance, 
procurement, military construction, research and development 
(Crecine and Fischer, 1971, pp. 46-50]. The Department of Defense 
holds its own budget hearings during the budget preparation 
stage [Navy Programming Manual, pp. IV-5]. As will be 
pointed out later, since 1961 OMB (or until 1969 the Bureau 
of the Budget) review of defense budgets has consisted of 
arcrcipation in these hearings, but their influence has been 
menaced (Halperin, 1972, p. 317]. 
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In theory, budget preparation should be a relatively 
simple process which merely requires crosswalking the costs 
of approved programs for’the next year into line item cate- 
gories. In reality, the budget preparation stage has not 
proved to be that Simple. Especially during the McNamara 
era, many decisions were made during this stage [Crecine and 


Fischer,1971]. 


Budget Preparation Under McNamara 


The discussion in this section emphasizes the fact that 
budget preparation in the McNamara era (FY1962 - FY1970) was 
not merely a straightforward crosswalking of approved programs 
into line items. It is largely based on the analysis of John 
Crecine and Gregory Fischer in their paper "On the Resource Alloca- 
tion Process in the Department of Defense." [Crecine and Fischer,1971] 
McNamara program budgeting was based on a@ requirements 
approach. The idea was that requirements were to be set and 
then programs developed which satisfied the requirements at 
the minimum cost. As Secretary McNamara himself said 


"The President's charge to me was a two-pronged 
one — to determine what forces were required and 
to procure and support them as economically as 
possible." [Tucker, 1966, p. 14] 


"We start with the political objective, the 
formulation of which is presented to us by the 
Secretary of State and upon which the President 
indicates his desires that we develop a military 
Meee anieciatewil | support the political objective. 
As you know, the President has stated the defense 
budget is to be established without regard to 
arbitrary budget ceilings. We determine the 

force levels which we believe are necessary to 
Bileeors wae political objective and then act to 





fulfill the President's second direction to 

us. He has indicated that we are to attain 

the specific force levels necessary to support 

the political objective at the lowest cost." 

[Tucker, 1966,p. 27] 
Because of this philosophy, the services were never given 
explicit fiscal or budgetary constraints in the McNamara era. 
However, regardless of budget philosophy, the defense budget 
plan must satisfy the following identity, called the "Great 


Identity" by Crecine and Fischer: 


Defense Expenditures 
+ = Tax Revenues + Deficit 


Non-Defense Expenditures 


If none of the other three terms are controlled by the 
secretary of Defense, then, like it or not, the Defense 
Department must operate under some kind of budget constraint. 
The research of Crecine and Fischer is largely based on this 
opservation. 

As Crecine and Fischer point out, tax revenues (estimated) 
meena On the state of the economy, as estimated by the 
Treasury Department and the Council of Economic Advisors, and 
on the tax laws, changes to which must be approved by the 
Congress in general and the House Ways and Means Committee 
in particular. Non-defense expenditures are, of course, 
proposed by the agencies involved and reviewed by OMB. 
Another factor related to non-defense expenditures is their 


uncontrollability. Many government expenditures, such as 


social Security and welfare payments, are made according to 
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fixed formulas and require basic changes in legislation if 
they are to be changed. Others, like interest on the national 
debt, must be paid, regardless of what else is in the budget. 
According to the Tax Foundation, Inc., in 1969 over $118 
billion in outiays were relatively uncontrollable. Of this 
total, $90 billion were outlays in non-defense areas [Tax Founda- 
von, i973, p. 10]. The other term in the equation is the 
deficit. The President, together with the Council of Economic 
Advisors, OMB and the Treasury Department yearly set a deficit 
target which they believe will achieve the appropriate balance 
between government expenditure requirements and stability of 
the economy — low unemployment and low inflation. Violation 
of this target will generally be viewed as involving economic 
penalties. 

During the McNamara era, the approved set of programs in 
the FYDP always implied expenditures higher than would be 
feasible under the "Great Identity" [Crecine and Fischer, 1971, p.37. 
Anger, 1973,p.6]. Consequently, after receiving the service 
budget requests, the directorates in the office of ASD(C) would 
submit planning estimates to McNamara which stated how much 
they believed could be cut from the budget requests. By this 
time (mid-October), McNamara had a reasonably good idea what 
the other terms of the Great Identity were going to be and 
could tell the directorates either to cut more or less than 
the planning estimates. In the meantime, he negotiated for 


deficit increases and for non-defense expenditure decreases. 
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One wonders how programs requiring resources in more than 
One category were pieced back together after the directorates 
mompleted their review. According to Crecine and Fischer, 
analysts from the following offices participated in putting 
the programs back together in November and December: Assistant 
Secretary of Defense for Systems Analysis, Assistant Secretary 
me Defense for Installations and Logistics, Assistant Secretary 
of Defense, Comptroller [Crerine and Fischer,1971,p.56]. Figure 
I.2 is a copy of Crecine and Fischer's flow diagram of this 
process, 

The record of budget cuts at the Secretary of Defense 
level in the McNamara era is consistent with the view that 
Significant changes were made in the budget at this pcint in 
the process. The following table gives the percentage 
reduction in service budget estimates prior to submission to 


Congress for 1962-1970: 


1962 Cao 
1963 9.78 
1964 ILS He 
1965 28.09 
1966 ZOReEe 
1967 Pale 
1968 22.91 
1969 24.09 
1970 24.69 
Abie 1.3 


Houval ©O.5.D. Reductions in Service Budget Estimates 
Source: Office of Assistant Secretary of Defense 
(Comptroller). 
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.2 MODEL OF OSD (CCMPTROLLER'S) BUDGET REVIEW 





Crecine and Fischer suspected that because the approved 
program (FYDP) would not necessarily be included in the 
budget, and because the services had been submitting budgets 
in line item categories for years, simple linear models might 
explain or predict service budget requests in each of the 
line item categories. These models included explanatory vari- 
ables like the previous year's appropriation, the difference 
in the last two years! appropriations, the difference between 
previous year's request and appropriation, and dummy variables 
for administration in power and whether or not the nation was 
at war. In most cases the models explained well over 95% of 
the variance in service budget requests from 1969 back to the 
Forties. Although analysis of this kind should be applied 
and used with care,* their resuits appear to indicate that 
regardless of budget philosophy at the Secretary of Defense 
level, the service budget requests have a remarkable stability, 
being related to some rather straightforward and uncomplicated 
variables. Crecine and Fischer also formulated simple linear 
modeis for directorate cuts. The results were similar to 
those for service requests. 

In summary, the total’ analysis of Crecine and Fischer 
leads us to suspect that budget preparation in the McNamara 
era was something other than a simple crosswalking of approved 


programs into line items. 


*Methodological problems with regression analysis of this 
type for Congressional appropriations will be discussed in 
Chapter II. 





Budget Preparation Under Laird 


When Melvin Laird became Secretary of Defense, he and his 
Deputy Secretary, David Packard, retained a three phase 
program budget cycle. However, they made some basic changes 
in emphasis. The Laird system has largely changed the approach 
from one which focuses On requirements. A five year budget 
constraint is set early in the programming phase, the second 
phase of the cycle [Kanter,1973,p.8]. In addition, the military 
services were given more of a say in the allocation of 
resources to programs [Anger,1973,p.10]. Laird and his succes- 
sors have communicated the 5-year budget constraint, by year, 
wWOuvirescrylces Via Clther a document called the Ficeaa 
Guidance Memorandum (issued in the Spring of 1970) or Defense 
Planning and Programming Guidance Memorandum (issued in the 
late witeter or @arly spring of 1971, 1972, and 1973). The 
first budget submission which reflected this new system was 
the FY 1972 budget. Table I.3 on the next page gives an 
example of the budget or fiscal c@enstraint given to the 
services. Note that the dollar constraints are given by 
program or mission, such as Land Forces or Tactical Air Forces; 
the service then puts together a proposal specifying how its 
Land Forces! dollars will be divided among the program elements 
under Land Forces. The set of service proposals is called the 
Program Objectives Memorandum (POM). The OSD responses or 
feersions on issues raised in the POM's are called the Program 


Decision Memoranda (PDM). 





; /’ TABLE 1.3 
pe: & FISCAL GUIDANCE 


Fiscal Guidance categories are divided into three groups. Group 
» Called "Major Mission," is made up of the FYDP program elements 
ontaining DoD combat forces and certain special support activities 
buch as Research and’ Development (FYDP Program 6), Communications, 
Intelligence, and Support to Other Nations. Group II, called ''General 
Support", contains elements from all Defense programs which provide 
ission and central support to Group 1 programs. Group III contains 
iscellaneous costs which are a function of management policies not 
directly related to the forces or their support. - 


oe ake ot ke ok 
~" Fiscal Guidance Categories and Services Concerned 


ategories: = ‘i Program Elements in: 






: Arm . Navy Marine Corp Air Force 
. eee Yi" «4s, x cca COC ; ; = 


I. Major Mission — | on 
A. “Force Mission 
l. .Strategic Forces: 
a. Offensive -— iP. 
b. Defensive xX 
‘“e, Control & 





a. 

















Surveillance yl x 
@. Land Forces x x 
3. Tactical Air Forces . xX X 
4.--Naval Forces: 
a. ASW & Fleet Air 
- Defense X 
b. Amphibious xX 
c. Support 4 
-5, Mobility Forces ~ x x 3 
B. Other Mission 
l. Intelligence & Security Xx xX 
&. Communications x Xx 
3. R&D (program 6) oS xX Xx 
4. Support to Other Né&ions: 
a&b. Military Assist. xX »4 » A 
c. Allies W.R. Stocks Xx Xx . 
I. General Support : 
A. Base & Individual Support x 4 Xx 
B. Training x Xx x 
C. Command xX x x 
D. Logistics x x > 
I. Miscellaneous Costs (retired 
pay, family housing, Military 
constructon) ba ee 2S x x _F 
Miscellaneous costs are defined by particular Resource Identification Codes (RICs) 
drawn from certain program elements: The TOA identified by these RICs are deleted 


from the Program Elements in which originally appearing and recategorized by RIC. 
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Under the Laird system, by the time the services submit 
their budgets in October, they have been specifying programs 
subject to a budget constraint for over seven months. 
Gensequentily, one might suspect that OSD cuts of the service 
budget requests, following the October submission, would be 
minimal. This turns out to be the case. In FY1972, the cut 
was .13 per cent and in FY1973 it was 2.06 per cent. By 
comparison, the average cut in the FY1962~FY1971 time period 
was 18.8% with the lowest figure being 8% in FY1962. These 
figures show that under the Laird system, major budget 
reductions have not been made during the budget preparation 
stage. Also, major program decisions have not been delayed 
until November and December; rather, most decisions have been 
communicated to the services by August. These two factors, 
the small budget cuts and the iSsuance of PDM's in August, 
appear to indicate that budget preparation under Laird has 
Bor the Bee part become the crosswalking of the budget from 
program categories into line item accounts discussed earlier. 

Having completed the discussion of the program budget 
cycle, and the budget prepreparation and preparation stages, 
it might be useful to summarize the chronological sequence of 
events. The 18 month program budget cycle for FY1974 began 
in June, 1971 with threat analysis and long range planning. 
The programming phasSe began in February of 1972 and continued 
through the spring and summer of that year. On October l, 
the Services submitted their budget estimates to OSD. This 


budgeting phase (i.e., budget preparation) continued througn 





December of 1972. Finally, in January of 1973 the President 
transmitted his FY1974 budget estimate to the Congress. 
Figure I.3 depicts this sequence of events in general anda 


Figure 1.4 depicts the events in detail. 





June February August ——___—_» October———anuary 
1971 1972 1972. 1972 1973 











Threat and-——--> Programming-——— Budget ———-} OSD and ———--yPresidential 


Policy Analysis Preparation OMB Review Submission 


PMGURE eb. 3 


I.3. OMB Review 


The Budget and Accounting Act of 1921 created the Bureau 
of the Budget (later re-named the Office of Management and 
Budget by President Richard Nixon) and made the executive 
Dudget part of the federal budgetary process. The Act 
provided for Presidential review of the budget and Presidential 
submission of the budget proposal to the Congress. One of the 
functions of the Bureau of the Budget was to assist the 
President in his annual review of agency budget requests. The 
Bureau's role as a mechanism for Presidential control became 
more clear when Franklin Roosevelt moved it from the Treasury 
Department into the Executive Office of the President in 1939 


[Redford et al., 1965, p. 325]. 
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FIGURE 1.4 PROGRAM/BUDGET REVIEW SCHEDULE 
CALENDAR YEAR 1972 


TT 








ITEM ACTION AGENCY ACTION DATE 
TT a 
1 Submit JSOP-Vol I (74-81) Strategy and Force Planning J Jun 23, 1971 
2 Provide comments on JSDP-Vol 1 to JCS 0 Aug 4, 1971 
3 lesue Defenae Policy and Planning Guidance 0 Oct 23, 1971 
4 Identify and ieeue Selected Analyeise Topice 0 Nov 1, 1971 
5 lasue Materiel Support Planning Guidance ra] Dec 7, 1971 
6 Submit coumdn te on Defenae Policy and Planning Guidance JC Dec 7, 1971 
? Submit JSOP-Vol 11 (FY 74-81) Analyeie and Force Tabulations J Dec 23, 1971 
8 Submit commenta on Materiel Support Planning Guidance Jc Dec 30, 1971 
9 Update Five Year Defense Program through FY 1973 Cc Jan 10, 1972 
10 Updste Five Year Deleenae Program for FY 1974~1977 Cc Jan 21, 1972 
12 lasue PDM Phiead oO Jen 31, 1972 
12 Issue Planning and Programming Guidance Memorandum D Feb 22, 1972 

Note: Includes (1) Force Planning (2) Fiscal Levels (3) SEA 

Assumptions and (4) Matertel Support Planning Guidance 
13 Submit Joint Research and Development Objectives Document (JRDOD) J Feb 22, 1972 
14 Provide selected analysis Jc Frior to 
Mar 31, 1972 

15 Submit Joint Force Memorandum (JFM) J May 8, 1972 
16 Submit Program Dbjectives Memorandum (PDM) Cc May 22, 1972 
17 Submit JSOP-Vol 1 (75-82) Strategy and Force Planning J Jun 1, 1972 

(CY 1973 cycle) 
18 Issue initial Budget Guidance for preparation of FY 1974 D Jun 15, 1972 

budget estimates 
19 lssue first "lssue Paper" (IP) 0 Jun 19, 1972 
20 Transmit first “Issue Paper™ (1P) to SecDef 0 Jun 23, 1972 
21 Issue last “Issue Paper” (1P) 0 Jul 17, 1972 
22 Transmit last “Issue Paper" (IP) to SecDef D Jul 21, 1972 
23- Issue Program Decision Memorandum (PDM) 0 Aug 4, 1972 
24 Submit reclamas to PDMs Cc Aug 14, 1972 
25 Issue reclama decisiong on PDMs D Aug 24, 1972 
26 lssue Defense Policy and Planning Guidance (CY 1973 cycle) D Sep 1, 1972 
27 ldentify and issue Selected Analysis Topics (CY 1973 cycle) D Sep 1, 1972 
28 lasue Materiel Support Planning Guidance (CY 1973 cycle) 0 Sep 5, 1972 
29 Submit annual Budget Estimates and backup information c Det 2, 1972 
30 Start Budget Hearings 0 Det 9, 1972 
31 Submit comments of Defense Policy and Planning Cuidance Jc Det 13, 1972 
32 Update Five Year Defense Program Cc Det 13, 1972 
33 Submit comments on Materiel Support Planning Guidance JC Det 20, 1972 
34 Start issue of Program/Budget Decisions (PBDs) D Nov 6, 1972 
35 Provide comments (reclamas) on PBDs Cc Nov 13, 1972 
36 lssue revised PBDs based on reclamsa commenta 0 Dec 1, 1972 

Dec rt 1972 

37 Conduct joint meetings with JCS and Service Secretaries to D Dec 15, 1972 

discuss major unresolved budget issues 
38 Submit JSDP-Vol 11 (FY 75-81) Analysis and Force Tabulations J Dec 22, 1972 

(CY $973 cycle) 
LEGEND: D= Secdef 

Je JCs 
C= Military Departments and Defense Agenctes 
JC » JCS, Military Deparements, Defense Agenctes Eneiouure 
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Between the end of the Second World War and the advent 
of the Kennedy Administration, defense budget requests were 
treated much like other agency requests by the Bureau of the 
Budget [Koro,1969, Ch. 3]; that is, BoB conducted an independent 
review of the budget and recommended adjustments (usually 
cuts) to the President prior to Presidential submission to the 
Congress. The mechanics of this review are discussed in [Ott 


and Ott, 1969,Ch.2]. Table I.4 contains BoB cuts from 1949 


through 1960. 


Fiscal Army Navy Air Force 
near Amount Percentage Amount Percentage Amount Percentage 
a0 5.82 Df Del 49 4.11 47 
iLsiowl O.17 : Oeaa D 0.80 a 
1954 0.90 9 ee ies Se (le. 2. 
1955 Pit af ISS ut ile oe 10 
1956 1.30 14 1.28 ii Sy 8 
BD | 0.44 5 IL lie 15 2.26 is 
1958 2.14 20 0.96 8 2.98 di5 
e586. «0.58 6 0.06 i the Se ff 
1960 2.47 ee 2.46 19 6 10 
1961 0.73 i O25 2 2.64 14 
(1950- 

1961) 16.52 b5e0 eal 


(1.65) (16.1) (1.50) (13.7) er) (16.6) 


DARE wd.. 4 


Since the beginning of the McNamara years during the 


Kennedy Administration, BoB and OMB have not conducted an 


independert review of defense budget estimates. 





meee should be noted that the budget of the 
Department of Defense is handled somewhat 
Giaterenviy from those of other agencies, 
The Bureau of the Budget participates with 
the financial officers of the Defense Depart- 
ment ina review of the requests of the 
various services for budgetary allowances, 
but its role here is not quite the same as 
with other agencies. It acts more as an 
advisor to the Secretary of Defense than as 
Smmecebauter.' [Ott andm@time 1960... 23). 


The current situation is discussed in the Navy Programming 
Manual , 

"The analysts of OSD and OMB normally make a 

joint review of the budgets submitted by the 

military departments. However, OMB analysts 

have authority to submit separate decisions 

on the markups." [p. IV-5] 
These "separate decisions" mentioned in the above paragraph 
apparently are not really decisions, since according to 
Morton Halperin, 

"Under Eisenhower, Kennedy and Johnson it 

became a matter of tradition that the Budget 

Director would have to appeal Secretarial 

decisions on the Defense budget to the 

President, the reverse of the situation in 

all other departments." ([{Halperin,1972,p. 317] 
Other literature conflicts with Halperin's view of the power 
of the Secretary of Defense's decisions under Eisenhower but 
is in agreement with his view of the Kennedy and Johnson 
years [Korb,1969, Ch. 3]. Currently, the Director of OMB sits 
on the Defense Program Review Committee, a committee of the 
National Security Council "whose purpose is to keep the 
annual defense budget in line with foreign policy objectives" 
[leacocos,1971,p.7]. The influence of this committee is more 


omeertans in the fiscal guidance stage of the PPB cycle than 


after budget preparation. 
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In summary, OMB review of defense budget estimates 
currently appears to be limited. The Director may play a 
part in setting the overall budget ceiling for the Department 
of Defense through his membership on the Defense Program 
Review Committee. However, this is quite different from the 
role which OMB plays in the budget review of non-defense 


agencies. 
T.4. Congressional Review of Budget Estimates 


ineroduction 


a wont Western democratic countries, the power of the 
purse is held by the legislature. (One exception is West 
Germany) [Macridis and Ward,1963,p.352]. The Congressional 
meerobriations process in the United States 28 probably more 
complex than the legislative appropriations process in any 
other country in the world [Macridis and Ward,1963]. The process 
involves numerous interactions in committees of both houSes, 
floor debate in each house, and negotiation between the 
meuses. Also, for most agencies, Congressional scrutiny 
occurs not only when appropriations are being considered, but 
also when authorizations are being examined. The purpose of 
this section is to discuss the basics of Congressional 
consideration of Defense budget requests. 

Before discussing Congressional budget review in detalii, 
the importance of the Congress in the overall defense budget 


process should probably be discussed briefly. Research on 





Congressional appropriations for defense has been sparse. Recent 
notable exceptions are ([Korb,1973, Kanter,1972, Goss,1970]. The 
classic studies of Fenno and of Wildavsky have explicitly 

been confined to non-defense areas as were the quantitative 
models of Davis, Dempster and Wildavsky. One of the reasons 
for this paucity of research might be the record of Congres- 
sional changes in defense budget requests between FY1954 and 
FY1968. The following table gives the percentage and 

magnitude of changes, cuts or increases, made by the Congress 


im that period. 


Fiscal Year %, Paeeiiivongo1 % 
1954 -—3.9 —~1089 
S55 -3.6 -1063 
1956 -l.i ~ 35 
1957 41.5 + 493 
1958 =O - 639 
1959 oO 1 SIS) 8 
1960 -O.1 = gt 
1961 T1.7 + 660 
1962 +0.58 + 268 
1963 +0.48 + 230 
1964 -3.66 -1797 
1965 =. 5 - 717 
1966 =, ls! =Weec ik 
1967 0s 70 + 4Q3 
1968 -2.30 S16 30 

Wap iL. 5 


Source: Office of Assistant Secretary 
of Defense (Comptroller) 


By comparison, in the 1962-1968 time frame, the Secretary of 
Defense cut service budget requests by an average of 18.6% with 


the lowest cut being 8% in 1962. Note from the previous 





table that Congressional cuts of Presidential budget requests 
in 1962-1968 never exceeded 3.66%. 

One of the reasons for the increasing amount of research 
on Congress and defense appropriations is the record of 


Congressional cuts since FY 1969. 





Fiscal Year % Millions of $ 
1969 -6.75 -5201 
1970 —7 wh -~5638 
1971 -3.13 -2147 
ION alia ~2951 
1973 -6 .56 —52e2l 
Abie | .o 
source: Office of Assistant Secretary of Defense 
(Comptroller) 


Richard Fenno in his analysis of non-defense appropriations has 
argued that Congressional changes of less than 5 percent are marginal 
or insignificant [Fenno,1966,p.353]. Since 1969, changes to the 
defense budget request were greater than 5 percent three 

times and in the other two years they were three and four 
percent respectively. It should also be noted at this point 
that in the 1962-1973 time frame, Congressional changes to 
procurement and RDTE requests were higher than the figures in 


Tables I.5 and 1.6 [Korb,1973,p.16]. 


Important Terms 


Three important terms that arise in any discussion of 
prageting are authorization, appropriation, and outlay or 
expenditure. An euthorization by the Congress gives approval 
Sieteneccions Or ectivities of an agency. In other words, the 


Peneress passes legislation authorizing or approving activities 
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Such as defense research and development. For many agencies, 
the authorizing legislation specifies a maximum amount that can 
be appropriated [Ott and 0tt,1972,pp.51-52] , and in essence 

says the agency can now seek appropriations for its approved 
programs. Theoretically, substantive issues such as busing 

or anti-busing provisions, program cancellations, and so 

forth should be settled in the authorization bill since the 
appropriations bill is supposed to "appropriate not legisiate."* 
In reality, the line between appropriation and legislation is 
somewhat unclear, and consequently substantive issues are 
often addressed in appropriations bills [Harris ,1964,p.87]. 

Ao eperopriatlion 1S the authorrry to obligate or commie 
the government to certain expenditures. Appropriations are 
generally defined as new obligational authority (NOA). 
Obligational authority may be granted for one, two or some 
specified number of years, that is, the agency may have one, 
two or some specified number of years to obligate the appropri- 
ations. In some cases, an agency may have no year accounts, 
Which are available for obligation until the purpose of the 
spending is accomplished. An agency's total obligational 
authority (TOA) includes not only NOA but also unobligated 
balances from prior years' appropriations. In the Departinent 


of Defense the following time limits are in effect for the 


“ant hemi zations¥ean inveffect appropriate if they provide 
funds through "backdoor financing" [Fenno, 1966,p. 114]. However, 
this is definitely the exception rather than the rule and is 
almost unheard of in defense areas. 





six major accounts: 


MP l year 

OM 1 year 
Procurement 3 years 
(except SCN) 

SCN## 5 years 

RDTE 2 years 

MILCON 2 years 


*@SCN refers to Shipbuilding 
and Conversion, Navy 
TABLE 1.7 


Expenditures or outlays are defined as the payment of 
Beagitities incurred by the government, or the actual cash 
flow.* Expenditures in a particular year for the Defense 
Department may be different from apprcpriations. For example, 
NOA in 1970 was $69.4 billion while outlays were $76.3 billion 
Mord ,1973,p.27]. The reason for this is that an obligation 
might be incurred when a contract is signed, but payment may not 
be completed until the item is delivered. Agencies generally 
have two years after the expiration of obligational authority 
to complete the financial transactions associated with that 
authority [31 U.S. Code, 701-706]. If an agency has obligated 
funds but not expended them, these funds are referred to as 


unspent obligations. The term unexpended balances refers to 


*The Navy Programming Manual defines outlays as "the 
amount of funds that must be drawn from the Treasury for goods 
and services received within the fiscal year under review." 


(p. L-1) 





the sum of unobligated balances and unspent obligations. 

In 1971, unexpended balances for the Department of Defense 
totalled $35.2 billion of which $11.5 billion was unobligated 
balances [House Report 92-1389]. Figure I.5 summarizes these 


relationships. 


TOA = NOA + Unobligated balances 


Unexpended 
Balances 


Unobligated balances + unspent obligations. 
Figure 1.5 

Although Congressional appropriations are defined as NOA, 
unobligated and unexpended baiances can have indirect effects 
on Congressional budget review. For example, if an agency has 
Omcvligated balances for which the cbligaticnal euthorityv is 
about to expire, the Congress may extend the obligational 
authority for one or more years but cut the NOA request by 
an amount equal to the amount of the extended obligational 
authority [House Report 92-666, p. 84]. Every year, the House 
Appropriations Committee includes the dollar figures for 
unexpended and unobligated balances in their report and 
Occasionally comments on the magnitude. An extensive discus- 
sion of unobligated balances for RDTE is contained in the 
House Appropriations Committee report on the FY1969 Appropri- 
ations bill [House Repert 90-349, pp. 51-52]. 

Because of uncertainty about just exactly when funds will 
be obligated, esvecially in procurement accounts, and when 


checks will be drawn, it is impossible to say for certain when 





a certain amount of obligational authority will be expended. 


For planning purposes, the Assistant Secretary of Defense 


(Comptroller) has published the following table for relating 


Department of the Navy outlays to TOA during the 1972-1976 


time span. 


fOr the past ten years. 


The table is based on data for outlays and TOA 


Suppose one wanted to compare the 


expenditure rate for military personnel with the expenditure 


rate for shipbuilding and conversion in the years 1972 and 


3. 


1972 TOA would be spent in 1972 while 


wT 3. 


Based on the table, it was estimated that 99% of the 
.90% would be spent in 


This picture is quite different from.the picture in | 


the slow expenditure account shipbuilding and conversion where 


it was estimated thet 6% of the TOA would be spent in 1972 


while 21% would be spent in 1 


973. 


(As percent of estimated total payments) 


Appropriation 


Military Personnel, Navy 
Military Personnel, Marine Corps 
Reserve Personnel, Navy 
Reserve Personnel, Marine Corps 
Operations & Maintenance, Navy 
Operations & Maintenance, Marine 
Corps 
-Procurement of Aircraft & Missiles, 
Navy 
- Procurement, Marine Corps 
‘Shipbuilding & Conversion, Navy 
“Other Procurement, Navy 
“RDT&E, Navy 
Military Construction, Navy 
(Dir. Prog. Only) 
Military Construction, Naval 
Reserve 


In 
First 
Year 


99.00 
96.50 
85 .00 
92 ,00 
62.50 


81.00 


16.00 
10.00 

6.00 
26.00 
54.00 


7.00 


In 


Second 


Year 


90 
3.30 
14.50 
7.00 
16.00 


def. 50 


53.00 
40.00 
21.00 
47.00 
38.00 


40 .00 


Ais le] Chel RO 


Note: 


ite 
Third 
Year 


ee) 
sae 
»50 
108 
ERs) 


1G 59. 


22200 
30.00 
25,08 
wy OTS. 
6.00 


30.00 


In ibe) 
Pour fifths 
Year Year 
(5616, 2.00 
1200 7700 
e100 Pie 8, 
6.00 4.00 
Isle, S518, 
13.00 TOeo0 


6.00 34,00 S050 UR O20 0am cOr00> >. 


Tne above rates relate to gross payments within a fiscal year pro- 


fram; they are not net outlay rates wnich may vary substantially because 
of changes in the volume of reimbursable transactions. 
are representative and are subject to change. 
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Now that the terms authorization, appropriation and outlays 

have been defined the role of the Congress in the consideration 
of defense budget requests can be discussed. In the following 
two sections, authorizations and appropriations will be 
discussed respectively. The direct role of the Congress in 
examining and/or limiting outlays has been minimal and will 

not be discussed specifically. Control of outlays has been 

Only in the form of setting an expenditure celling on the 

entire federal government through the Revenue and Expenditure 
Control Act of 1968. Aside from that one instance, the Congress 


has not legislated on the matter [Maxon, 1972]. 


The Authorization Process 


"No funds may be appropriated after December 31, 
1960 to or for the use of any armed force of tne 
United States forsthewprecureien or wamuoenait . 
missiles or naval vessels unless the appropria- 
tion of such funds has been authorized by legis- 
lation enacted after such date." 
Sece1One 4th) 
Military Construction Act of 1960 
(Public Law 86-149) 


Prior to the enactment of P.L. 86-149, only Military 
Construction required authorization as well as appropriation 
by the Congress. With the enactment of that law, procurement 
of aircraft, missiles and naval vessels came under authoriza- 
mon as well as appropriation scrutiny. Later, other procure~- 


ment accounts and RDTE also were required to have authorizations 
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memor CO appropriations action.* Currently, only operations 
and maintenance, military personnel and some small procurement 
accounts** may receive appropriations without prior 

EmGnori zation. 

As with non-defense agencies, authorizations for the 
Defense Department are first examined by the substantive 
committees, in this case the House and Senate Armed Services 
Committees. Prior to 1961, the House Armed Services Committee 
was characterized as a "real estate committee" [Dexter,1963] since 
its annual review of the Defense Department was confined to 
the Military Construction Authorization. Today, the Armed 
Services Committees annually authorize aircraft, missiles, 
naval vessels, tracked combat vehicles, other weapons, RDTE 
Ppaaotmiagitary construction. An actual authorization contains 
upper limits on appropriations for each of the services in the 
relevant categories, together with any special legislative 
provisions that may be deemed necessary — such @4sS a ban on 
Sep ouilding in foreign shipyards... Authorization acts do nor 
contain detailed breakdowns of the subcategories under broad 
categories of aircraft, missiles, and so forth which are 
listed above.*** However, if a committee wants funds to be 

*RDTE in fiscal 1963, Tracked Combat Vehicles in fiscal 
Moo, Other Weapons in fiscal 1970, Torpedoes in fiscal 1971. 

**¥Other Procurement Army, Other Procurement Navy and Other 
Procurement Air Force and the Army's ammunition account are 
the procurement accounts not requiring authorization. 

**¥*Occasionally, the statute will contain a statement about 
aspecific program. This has been the case with the Safeguard 


Siti=-Dallistic missile system. However, such specific language 
memocelaom found in the statute itself. 
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cut from a specific program, such as the CH 47 cargo transport 
helicopter, it will generally say so in its committee report, 
which forms part of the legislative history of the authoriza- 
jon. The legislative history is used later to determine the 
intent of the law and actually forms as important a part of 
the authorization as the actual statute [Harris,1964,pp. 90-95]. 
The authorization process begins with the House and the 
Senate Armed Services Committees' hearings on the defense 
budget requests in the procurement and RDTE areas. The actual 
authorization bill has generally been introduced in each house 
by the chairman of the Armed Services Committee for that house. 
After a committee's hearings are concluded, it marks-up 
(modifies) the bill in executive or closed session and writes 
areport. The committee's bill is then reported to the Floor 
where it is debated and sometimes amended prior to passage. 
If the House and Senate versions of an authorization statute 
are in disagreement, a Conference Committee composed of members 
of the Armed Services Committees meets and works out a compro- 
mise. There are only two votes in Conference, the House vote 
and the Senate vote. A compromise requires the approval of a 
majority of the House members and a majority of the Senate 
members. The Conference Report summarizes the compromises 
that have been reached by the Conference Committee. Once the 
report is approvee by each house the compromise version of the 
ft 1S sent to the President for his signature. Occasionally 
One house sends its conference representatives to Conference 


eeieanstructions, that is it instructs the representatives 
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Mes tO back down on a particular provision. Such action is 
rare, however, since it makes it more difficult to reach a 
compromise [Redford, 1965,pp. 419-420]. 

some authors have argued that the aAULNhOrazeauion role we. 
the Armed Services Committees is related to an increasing 
eoncern on the part of the Congress over national security 
policy. Raymond Dawson, in the early 1960's, expressed the 
belief that Section 412(b) opened a new era in Congressional 
oversight of foreign and military policy [Dawson,1963]. A decade 
later, Arnold Kanter argued that Congressional behavior in the 
1960-1970 period was "policy oriented" [Kanter,1972]. Interviews 
of Armed Services Committee staff members indicate that they 
believe that they have developed expertise in certain military 
areas such as tactical air warfare [Rerry and Peckham,1973]*. 

timevmeworher hand, other authors have noted very itcile 
change in the approach of the Congress toward the défense issues 
[Kolodzei ,1963, Goss,1970]. Recently, Lesiie Korb has argued that 
even recent large cuts in Defense budgets do not reflect a 


policy orientation on the part of the Congress [Korb,1973]. 


Appropriations 
Appropriations bills originate in the House of Represent- 
meaves. ‘The Constitution specifies that revenue bills must 


originate in the House, and this has been extended by tradition 


*This information is based on interviews by Berry and 
Peckham which were not discussed in their thesis. 





bo appropriations bills, probably because appropriations and 
revenues were originally considered simultaneously by the 
Congress [Harris,1964,p. 52]. In the defense area, military 
personnel, operations and maintenance, procurement and RDTE 
are all funded by one appropriations bill, the Defense 
Appropriations Bill. Military Construction is funded 
separately.* 

After its introduction in the House, the Defense Appro- 
priations Bill is referred to the House Appropriations Com- 
mittee which in turn refers it to the Defense Appropriations 
Subcommittee. The Subcommittee holds hearings, marks-up or 
modifies the bill and reports it to the full committee where 
it is usually reported to the full House without further 
changes [Fenno,1966,p.134}. As with Armed Services Committee 
reports, Appropriations Committee reports form an integral 
part of the legislative history of the appropriation and ame 
used to determine the intent of the Congress. Changes to 
appropriations bills on the House Floor are rare. hiechawe 
Fenno has discussed in considerable detail the reasons for this 
phenomenon [Fenno,1966, pp. 433-434]. When they do occur, 
maey are usually minor. Of 591 non-defense Bilis in wthe earmky 


Sixties examined by Fenno, 517 were passed without amendment 


[Fenno,1966,p.450]. Between 1961 and 1972, 7 defense bills were 


*Military Construction, which comprises a very small 
proportion of the defense budget, will not be discussed 
specifically in this analysis. 
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passed without amendment, while three were passed with just 

One amendment. Two of those three included amendments 
introduced by George Mahon, Chairman of the Defense Appropri- 
ations Subcommittee and were supported by the Subcommittee. 

The other two years, the defense bill was passed after two 
amendments were approved [Congressional Quarterly Almanacs, 
1961 through 1972]. Of the seven amendments approved in this 
time frame, two amendments prohibited the construction of 

ships in foreign shipyards, two amendments prohibited any uSe 
of appropriated funds for a domestic peace corps, one amend- 
ment reduced funds for the Army by $10 million since a price 
change had occurred since the Subcommittee had considered the 
budget request, one amendment included funds for a new 
destrover which had not been included in the Subcommittee's 
version of the bill since Senate authorization of the destroyer 
had not been completed, and one included increases in operations 
and maintenance funds for Army, Navy and Air Force of $50 
million each, an amendment introduced by Mahon. 

After House passage, the Defense Appropriations bili is 
brought to the Senate where it is referred to the Senate 
Appropriations Committee and subsequently to the Defense 
Appropriations Subcommittee. It should be noted, though, that 
the Subcommittee has usually been holding hearings on the 
original House bill for quite some time, even though the bill 
has not yet been formally referred to the Senate. The 


following table gives the average number of days between the 


3 





opening of Senate hearings and the passage of the Defense 
Appropriations Bill in the House for the 1946-1949, 1952-1955 
and 1962-1965 time periods 


1946-1949 1952-1955 1962-1965 
+26.2 -48,.0 -90.8 
[Note: "+" means bill passed before hearing started. |] 
TABLE I.9 


In recent years, the Senate Defense Appropriations 
Subcommittee and Senate Armed Services Committee have been 
holding joint hearings on procurement and RDTE. 

After hearings are concluded and the House passes its 
bill, the Subcommittee marks up the final House bill and 
ferpes itbS report. Tne full Appropriations Committee then 
Peters the Subcommittee bill to the Senate Meow, Saige 
1961, twenty amendments to defense appropriations bills have 
been adopted on the Senate Floor. Some of these amendments, 
especially in recent years, have involved significant changes.* 
Even these totals, however, probably do not adequately capture 
the role of Floor consideration in the Senate. For example 


in 1970 only one amendment was offered and it was adopted, 


*For example, in FY1971, a floor amendment provided $500 
meron to lsrael of which $250 million was for the purchase 
of F-4 phantom jets [Congressional Quarterly Almanac, 1971, 
ee 341). 
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but Congressional Quarterly states, "Many members had prepared 
moor amendments to trim the bill if the committee had not 


done so." (Congressional Quarterly Almanac, 1970, p. 419] 

After Senate passage, a Conference Committee composed of 
meme Or all of the Defense Appropriations Subcommittees' 
members meets to iron out differences between the House and 
Beneve versions of the bill. As with authorization bills, 
the Conference reaches a compromise agreement and writes a 
Conference Report. After House and Senate approval of the 
Bomnterence Report, the compromise bill is sent to the 
President for his signature. 

Like authorizations , appropriations acts set dollar 
limits for broad categories like Navy RDTE, -and Army 


Peceroatit procurement. Specifie programs are seldom, if 


¢ 


ever, mentioned in the actual legislation. The Congress and 
the Defense Department know how the cuts in budget requests 
are to be allocated (unless the cut is an undistributed or 
nonspecific reduction) by the legislative history of the 
Sppropriations act. Most important in this legislative 
history are the Committee reports. tudy of these reports 
yields significant information about the reasons for budget 
reductions and the intent of the Congress. For example, 
since budget prepreparation took place over 18 months prior 
to Congressional review, the Department of Defense itself 


Will often recommend reductions because of factcrs such as 


luccepe 





schedule slippages.* In the budget execution stage Committee 
Mepervts are important sources of information for détermining 


the legality of switching dollars between different accounts. 
fs Budget Execution 


The passage of an appropriations act by no means marks 
mime end of the budget process. In fact the budget execution 
aucee, Which begins with the passage of the act,is one "ofr the 
most important of all the budget stages. During budget execu- 
tion, funds are apportioned, allocated and allotted by OMB, 
OSD, and the Services to those levels of the Defense estab- 
lishment which will obligate and spend the appropriated funds. 
Also, during this stage the Services frequently seek and obtain 
authority from the Secretary of Defense and the Congress for 
transferring appropriations from one account to another — a 
process called reprogramming. The budget execution stage as 
discussed here includes all the activity between the signing 
of the appropriations act and the actual expenditure (outlay) 
memappropriated funds. Activity in this stage is affected by 
myriad rules and regulations; consequently, only the selected 


portions or elements will be discussed. 


History 


A traditional budgetary problem has been the problem of 


Meticiencies; that is, overobligation by an agency and then 


*Por example, REDEYE missile in 1964. [House Report 1329, 
P. 35, April 17, 1964] 


T-HO 





presentation to Congress of a "fait accompli." As one member 
puts it, the departments "can make these deficiencies and 
Congress can refuse to allow them; but after they are made 

it is hard to refuse to allow them." In order to address 
this problem, Congress passed the Anti-deficiency Act of 1903 
which provided for agency apportionment or allotment of 
Obligational authority, generally on a quarterly basis. It 
also provided for penalties for exceeding the apportionments 
or allotments [U.S. Code 665(a)-(i)]. Later, in 1921 the 
Budget and Accounting Act vested apportionment power in the 
Bureau of the Budget (later OMB). To this day, "OMB reflects 
Presidential control and can restrict the rate or purpose of 
obligations as provided for by law." [Navy Programming Manual, 
py IV-7 1] 

In the Department of Defense, the Office of the Secretary 
of Defense becomes very much involved in the apportionment 
process. Apportionment decisions have actually been the joint 
responsibility of the OMB and OSD. During the 1960's impound- 
ments (refusal to apportion Congressionally approved obliga- 


tional authority) were the result of OSD decisions.# 


The Apportionment Process 
The apportionment process resembles the budget preparation 


stage of the budget process. As the Navy RDTE manual states 





*The classic example is Secretary McNamara's refusal to 
Memertion funds appropriated for the B-7/0 bomber. 
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"The bureaus, systems commands and offices 

conduct their reviews of apportionment programs 

in much the same manner as budget estimates are 

reviewed. The review and apportionment program 

to OSD is the same as for budget estimates." 

[Navy RDTE Management Guide, p. 5-2]. 
Preparation of apportionment proposals or requests begins 
well before the passage of the appropriations act. The 
service Comptroller offices generally obtain all the appor- 
tionment requests within one week aflter the passage of the 
appropriations act while the final request is submitted ele. 
OMB within fifteen days after the passage of the act.* 
Hearings are held on the apportionment request by the 
Services, OMB and OSD {Navy RDTE Management Guide, pp. 5-2 
to 5-3]. 

The reason for all this paperwork is two-fold. First of 
ail, by the time the apportionment request is acted upon 
approximately two years have elapsed since the budget pre- 
preparation stage for the fiscal year in question. During 
that time the threat may have changed, priorities may have 
emengzed, projects may have fallen behind schedule, costs may 
have changed. A second reason is that the Congress has 
Generally cut the appropriations request (in line item 
categories) and those cuts must be applied to individual 


programs in the absence of specific Congressionai instructions. ** 


The apportionment request is submitted on for DD1105. 
Final authorization is approved on approved form DD1105 and 
on form SD-HEO. 


*Por example, in 1968, 1969 and 1970 the Congress made 
Mages tributec reductions in the RDOTE budgets. 
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Reprogramming 


As the fiscal year progresses, needs may change, some 
projects may fall behind schedule, new projects may arise or 
any number of unforeseen eventualities may surface. The 
device for handling this problem during the budget execution 
Stage is called reprogramming. Reprogramming is the shifting 
of funds from specific uses originally planned to others 
where these funds can theoretically make a greater contribution 
to organizational or military effectiveness. In other words, 
it is using money for reasons other than those for which it 
was appropriated. Reprogramming in the Defense Department 
plays a very important part in the budget process. However, 
it is not always looked upon favorably by the Congress. For 
example, in 1971 Representative Wnitten of the House Defense 
Appropriations Subcommittee stated in a hearing on defense 
reprogramming, "We tend to give money for a high priority 
project and then you...come in and say you want to use it for 
a low priority project." [House Appropriations Committee 
Hearings on Defense Appropriations, 1971, Part II, p. 603] 

Reprogramming procedures have been well-defined in the 
Defense Department since the early Sixties [Parker,1973,Ch. 2]. 
Following the passage of the appropriations act, the Services 
submit to the Congress, via OSD, their Base for Reprogramming 
Decisions. This document (DD1414) “identifies the purposes 
im terms of budget subactivities of the...appropriation and 
the amounts for which funds have been authorized and 


appropriated." [Navy RDTE Guide, pp. 5-6] It also reflects 
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the application of cuts made by the Congress. For example, 
the Congress may make an across-the-board 3-percent cut in 
operations and maintenance. The Base for ‘Reprogramming 
decisions will show how this reduction oe applied to specific 
programs. 

Suppose the Services desire to reprogram. Specific 
approval is required by SECDEF and the Armed Services and 
Appropriations Committees if the reprogramming involves any 
programs or functions specifically reduced by Congressional 
action or items which the Committees have expressed an interest 
in.* The latter can be Ccetermined by a review of the legis- 
lative history of the authorization and appropriations bills: 
hearings, committee reports, floor debate. Approval of SECDEF 
and notification of the Congressional Committees is required 
if the reprogramming action involves an increase of $2 million 
or more in any budget subactivity or the addition of a new 
subactivity line item the cumulative cost of which is estimated 
to be $10 million or more over a three year period. Because of 
the way the procedures are set up, the Services obtain maximum 
flexibility the more programs they fund under each line item 
(rather than increasing the number of line items) since 


reprogramming is based on "fences" drawn by the line items. 


On the other hand, the Congress increases its control by 
Grawing more "fences" or requiring more specific breakdowns 


Of budget estimates. 


*A reprogramming, request together with its justification 
js transmitted on a form DD1415. 
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Twice each year the Services transmit to the Congress a 
"Report on Programs" which summarizes all reprogramming 
approved during a 6 month period, including those actions 
which did not require SECDEF or Congressional approval. The 
Report, called a DD1416, includes columns reflecting the 
original program (appropriations) as approved by Congress that 
was earlier transmitted on the DD1414, all approved reprogram- 
ming actions since that time, all reprogramming actions that 
were taken which did not require reprogramming and the current 
program (as modified by reprogramming). 

Most reprogramming requests are handled by the chairmen, 
ranking minority members and senior members of the Defense 
Appropriations Subcommittees and the Armed Services Committees. 
As Stephen Horn, a former Senate Appropriations staff member 
has said, "Most junior members of Senate Appropriations are 
unaware of the vast reprogramming responsibilities held by 
their senior colleagues. . .the requests and the disposition 
of them almost never become known to others on the subcommittees 
or full committee." [Horn,1970,p.194] Occasionally, a reprogram- 
Ming request will stir up controversy, such as a reprogramming 
request seeking funds for the STEP program in the Sixties.* 
Members who are worried about potential reprogramming without 
Congressional oversight will often seek an amendment to the 


Pepropriations or authorization bills or at least will try to 


*STEP was a program to train individuals who failed to 
pass armed services qualification tests. [fHorn,1970,p. 195] 





Benerate floor debate which will cover a particular item. A 
good example of this are the amendments to Defense Appropri- 
ations bills introduced by H. R. Gross and passed by the House 
in the mid-Sixties which prohibited the Department of Defense 
from using appropriated funds to establish a domestic peace 
corps. With the passage of these amendments any attempt to 
reprogram funds (no matter how small the amount) into a 
domestic peace corps program would have required Congressional 
Committee approval. However, if Congressman Gross's 

amendment were opposed by Committee members, a reprogramming 
request which might subvert the purpose of the amendment could 
have been approved without being brought to the Floor of the 
House or the Senate. 

An important question that comes to mind when discussing 
reprogramming concerns its magnitude. How much money is 
reprogrammed yearly? Statistics for DoD are scarce since some 
of the reprogramming documents are classified. During the 
four fiscal years 1961 through 1964, over $8.8 billion in 
Metense reprogramming was undertaken for missiles, ships, and 
RDTE alone.{Report of House Armed Services Committee, July 8, 
1965]. The approved procurement and RDTE budgets for these 
fears totalled approximately $83 billion. 

A recent study by Commander John T. Parker [Parker,1973,pp. 
90-91 and 97-98], includes the following figures for 
reprogramming. Unfortunately some of these figures do not 


reflect what is termed "below threshold reprogramming" — 
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that is reprogramming for which official notification of 


SECDEF and/or the Congress is not required. 


FY Reprogrammed DOD Budget 
61% Sos 40.28 
62 1.91 46.49 
63 ett 48.35 
64 2.86 49,81 
65 29 1 49.25 
66 3.60 O25 1s 
67 5.00 Tivos 
68%* 7.04 fe. O23 
69 Wouy 303 
70. 2.43 71.29 
(el 3.26 68.63 
he ek: fean6 


iste Oet sn bavlLions ) 


Note: *1961-1963 data is only RDTE and Aircraft-Missile 
reprogramming. 


**1968-1972 data is only above threshold programming 


Mo. Audit 


The final step of the budget process is the audit. The 
Beneral Accounting Office undertakes audits for the Congress 
While various components within the Defense Department and 
mee services undertake them for the executive branch. 

The legislative audit undertaken by the General Accounting 
Office (GAO), includes not only an examination of accounts for 


accuracy and adequacy but also includes scrutiny of the legal 
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basis for expenditures in order to ascertain whether or not 
they were made in accordance with the letter and intent of 
mee law. As might be suspected, the intent of the law is 
usually determined by examining the legislative history of 
the appropriation against which the obligation was charged. 
A comprehensive discussion of GAO, its history and its 
munetions is contained in [Harris,196 chapter 6]. 

GAO was established in 1921 by the Budget and Accounting 
Act. Until 1921 auditing of governmental accounts was done 
by auditors in the Treasury Department. The Budget and 
Accounting Act declared GAO to be independent of the executive 
department. The Comptroller General is appointed by the 
President to a fifteen year non-renewable term and can be 
memevea Only Sy a joint resolution of Congress. During the 
first three decades of its existence, GAO was attacked by 
executive agencies as being unduly restrictive in its 
accounting and financial management requirements. Finally, 
the Budget and Accounting Act of 1950 called for reform in 
accounting and auditing methods. Cooperation between GAO, 
the Treasury, and the Budget Bureau in the area of accounting 
and mcebine folaewea thismlestciationmi harris sloGiissmeeS |. 

GAO has the power of disallowance. Consequently, an 
executive officer may be subject to penalties if GAO does 
not agree that an expenditure was within the letter and intent 
of the law. To protect themselves, many executive officers 


Beek advance GAO rulings on questionable expenditures 








[Harris,1964,p.145]. Also, at times the Congress has granted 
exemptions to agencies which state that decisions on the 
Mmeeatity of expenditures by executive officers are final, 
which means that GAO cannot disallow these expenditures. 
Occasionally certain Defense activities have been exempted, 
especially during wartime [See Harris,1964,p. 147, for an example]. 

In addition to auditing agency books, the General 
Accounting Office has in recent years assumed the role of 
investigator for the Congress. Every year numerous reports 
on the financial management systems of all or part of various 
federal agencies are submitted to the Congress.* Every agency 
is not audited every year, but GAO can decide to audit or 
investigate it at any time. The audit or investigation may 
be authorized by GAO on its own authority, by a Congressional 
Committee or by an individual Congressman. 

The National Security Act Amendments of 1949 established 
mae internal audit of defense accounts as a function to be 
performed by the offices of the Comptrollers of the Services 
and the Comptroller of the Department of Defense. Since GAO 
does not audit all DoD accounts every year, internal audits 
are both protective and constructive, attempting to detect 
items that might be of interest to GAO and to provide manage- 
ment with information related to economy and efficiency. 

Mhe audit stage is seldom ineluded in discussions of 


Metfense budgeting. Nevertheless, it is important to realize 


*Recent examples include studies of the DD963 and the COA. 
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that the audit function is performed and that it places some 
Mimics on Service activities; for example, unreported repro- 
gramming is likely to be caught by GAO. This in turn means 
that the Services have some incentive to try to insure that 
appropriations are granted for the items they believe to be 
important. In other words, the Services can in general not 
take an appropriations estimate "off a license plate" and expect 
to be able to reprogram with impunity later, given the pre- 
viously mentioned reprogramming reporting requirements and 


the possibility of GAO audit and disallowance. 


mf. oummary 


In this chapter, the defense budget process has been 
outlined, beginning with the budget prepreparation stage, and 
terminating at the audit stage. This description is a 
necessary preliminary to the more technical analysis contained 
in the later chapters. These chapters will focus on the 
Congressional review stage and will attempt to characterize 
the response of the Congress as a whole to the President's 
budget request for Defense procurement and research and 
development. The discussion of this chapter hopefully will 
have impressed the reader with the numerous factors behind 
and the background of the budget request of the President and 
likewise the numerous elements which affect final Congressional 
action on that request. Although any given year's budget 
process may be viewed as a sequence of events, budget execution 


for year t~2 may affect the budget preparation for the budget 
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of year t. Thus any attempt to formulate deterministic 

models of even part of the process does not appear to have 

much chance of success unless the models contain an exception- 
ally large number of variables. The analysis tobe undertaken here 
abstracts from many of the potential variables that might 

affect Congressional action in any one year in order to 
formulate, and estimate parameters for-simple, probabilistic 
models. These models are designed to characterize Congressional 
activity on the average and to identify departures from this 


routine or uSual behavior. 


ce 





CHAPTER II 


Tl.1. Background 


The literature on Congressional budeeting is extensive, 
with major contributions dating back to Arther MacMahon's 
classic discussion of Congressional oversight [MacMahon, 1943]. 
Other works include those of Huzar, Gazano ll som cthies , 

Wallace, Harris. Since 1960, the two most frequently cited 
studies of Congressionel budgeting have been those of Wildavsky 
[Wildavsky ,1964] and Fenno [Fenno,1966]. These two works are 
similar in many ways. Both are narrative discussions of 
Congressional budgeting based on interviews with Congressmen, 
Senators and staff members. Both conclude that, faced with 
complex and detailed budget requests, the Congress (appropri- 
ations subcommittees in particular) adopts simple approaches 

to budget review. These approaches are best characterized as 
incrementalism and sampling (Wildavsky ,1964,pp. 14-15, and Fenno , 1966, 
pp. 332-340). By incrementalism it is meant that the Congress 
often makes marginal, percentage-like decisions on budget 
requests. [For example, an sncremental decision rule might, 

as a rule, grant 90 percent of the budget request.] By 


sampling it is meant that the Congress will select some item, 





often a small, insignificant and familiar one, and examine it 
in great detail [Fenno,1966,p. 336]..If the item is justified, 
Other items will be approved. If not, the item will be cut, 
possibly eliminated, and another item will be examined. 

As a sequel to Wildavsky's work, Davis, Dempster and 
Wildavsky [Davis,et al.,1966,1967,1971] formulated a number of 
simple linear models of budget requests and Congressional 
meet Oon on budget requests in the non=G€TEnse™sectoOr. The analysis 
to be presented here was partially stimulated by examination 
of these models. The Davis, Dempster, Wildavsky (DDW) models 
choose as their level of analysis the total agency budget, 

That 18 the budget of agencies or bureaus such as the Bureau 
of Land Management in the Department of the Interior. The 
mmelS ana data considered here refer to a lower level of 
aggregation. 

Recently it has been suggested [Kanter,1972] that the Fenno 
and Wildavsky analyses and the DDW models do not apply to 
Defense appropriations. The argument has been made that 
Congressional action in this area is motivated by concern over 
Questions of national security [Kanter,1972,p. 138]. Another 
author has argued that Congressional action has been motivated 
by fiscal or economic concerns [Korb,1973]. On the other hand, 
a 1970 Rand study [Stromberg,1970] attempts, with some success, to 
apply DDW models to Defense budget data from the years 1953 
mo 1968. 

The purpose of the analysis done here will not be to 


reconcile the differences between DDW, Kanter, Korb and 
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Stromberg. However, it should be pointed out that those 
Studies are not necessarily totally inconsistent. For example, 
it is possible for the Congress in general and the House and 
senate defense appropriations subcommittees in particular to 

be making specific cuts in specific procurement programs such 
as the F-14 aircraft for fiscal or policy reasons. Yet, at 

the same time, it is possible that these subcommittees have 
implicit (or possibly explicit) target figures for the 
appropriations in the general categories like procurement 
aircraft and missiles Navy (PAMN). 

In this chapter and in subsequent chapters, appropriations 
at the level of PAMN will be analyzed using several simple 
models. The data is comprised of requests and appropriations 
mer aircraft and misSiie prccuremcnt for the Army, Navy and 
Air Force, and research and development (RDTE) requests and 
appropriations for the Army, Navy and Air Force. The analysis 
was limited to these categories since they are the ones in 
Which Congressional cuts (or additions) have been most frequent 
and of the greatest size [Korb ,1973,p. 16]. 

The purpose of the analysis will be to characterize the 
average or usual results of Congressional appropriations activity 
(at the levels of aggregation reflected in the data), and to 
identify as vividly as possible departures from the usual or 
average results. A number of behavioral explanations exist 
for the characterizations presented in the following chapters, 
including incrementalism, sampling, and policy and fiscal 


concerns at the program level. However, the focus of this 
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analysis is not on the behavioral explanations of the budget 
process but rather on the results of the process.* 

The analysis in this chapter begins with a discussion of 
the question of how one might construct a model of appropria- 
tions aS a percentage of budget requests. Next the methodol- 
ogy employed by DDW and Stromberg in their studies is reviewed 
and the methodology adopted in the data analysis reported 
later in the chapter is presented. The data; its sources and 
organization, will be explained and results of the analysis 
of the data, using a variety of analytical techniques, will 
then be reviewed. The final section is a summary of the 


meer conclusions of the chapter. 


tIi.2. Modelling Appropriations as a Percentage oF Request. 


iimmeany One year, after the Congress appropriavecwiundcs , 
one can look back on the request and note that the appropria-~ 


tions are some percentage of the request — or 


= BX, where ve is the appropriation 
in year ¢ 


is the request in year ¢ 


B, 18 the percentage Or 
average, of the request 
tid tale sa piee pied awe ar 


If one is attempting to model the regular or usual results 


of Congressional action, a potential approach would be to say 





Meera 1S Similar to the focus of a recent study of Atomic 
Energy Commission budgets by Natchez and Bupp [Natchez and Bupp> 
73). 
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Phat the percentage is a constant, depending neither on the 
mequest or the year. In other words, in order to sort out 
usual budget outcomes from unusual Ones, one might contend 
that on the average, the appropriations are a fixed percentage 
of the request. However, since such a simplified approach 
leaves out numerous intricacies of the budget process which 

in any one year may affect the final budget outcome, a 
reasonable modification of the above might include a stochastic 
error component which in some sense symbolizes other factors 
not a part of the simple percentage statement. These other 
factors, or errors, can be modelled in Several ways. For 
example, they can be viewed as being unrelated to the percent- 
age, 8, and unrelated tc the request. This would lead to a 


modei of the form: 


ven Bx, a Ey where Ey is a random 
disturbance. 
On the other hand, one might view the percentage in any given 
year as random, with it being the result of the interaction 
of a fixed and a random component. This might be modelled 


by an equation of the form: 


Where ies coe randon 
~ disturbance .* 


*The exponential form will enable us to use standard 
estimating procedures after taking logarithms. 
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The percentage in a given year might also be viewed as 
random but the result of the additive effect of a random 


disturbance on a fixed component: 


tg 


ee = B + ¢ 


ct 


Note the difference between ¥, 7%, = 6 + €, and 
u 
¥, SX, = pe t . The latter equals 
B + Bu, t Bu, °/2! + Bu, °/3! a hice Gie random 


component (the sum of all the terms after 8) is related to 


the size of 8. 


One point which has not been addressed is that another 
way of looking at appropriations as being a percentage of 
requests that this percentage as having some regular or 
usual component to it is to view the percentage as being 


dependent on the size of the request; for example: 
‘oe Bx, 
Or 


Ve = Cex, (O-}) 4 xy 


TI-6 





where (Bx, °°] represents the percentage of the request 
meeropriated. This representation resembles a graduated 
income tax scheme (if 0 < a <°1) in that as x, gets larger 
the percentage of the request appropriated gets smaller. 
More specifically, since the elasticity of eras equals 
(a-1) the above equation implies that if the request changes 
by % percent, the percentage of the request appropriated 
will change by (a-1)z percent.* Again, other factors 
discussed earlier may cause the percentage not to be exactly 
a), Thus, a random component might be included in the 


model making it look like: 


ao 
x 


ee 
= px, 1). U where u, is a random 


1y disturbance 


In the above paragraphs, four somewhat different ways of 
characterizing appropriations outcomes have been discussed. 
These were formulated into the following equations: 


momlasticuey of ie is equal to 


x, 6o-1) 


dB , 
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2.1) Ve = BX, + fy 


me. 2) a gx,e © 

(2.3) V4 = Bx, + €,X, 
a a Ut 

(2.4) Y, = BX, € 


Mional characteristic which is attractive for estimation 


murposes is 


Gi u,in Xy 


m.5) y, = Bx, e 
mee wequvatiensware rhe resust~of different ways of thinking 
about the routine, percentage aspect of Congressional 
appropriations. As discussed earlier, they do not imply 

that Congressional decision-makers always, or even ever, 

think in precise percentage terms. (Although they might.) 
Rather, they are different ways of characterizing the usual 
end-product or overall results of Congressional action and 

may provide insight into what outcomes are unusual or 
irregular. 

Up to this point, nothing has been said about the 
statistical questions involved in estimating 8 (and a) from 
sets of data. It turns out that the equations just formulated 
also appear to be alternatives one might consider when faced 
with the problem of estimation and the aSsumptions that 
necessarily underlie the computation of estimates. The 
Giscussion that will follow will concentrate msinly on the 
Svatistical aspects cf the problem of characterizing Congres- 
Sional appropriations as a percentage of requests. The 
methodology of DDW and Stromberg will be discussed first. 
Then alternative approaches will be suggested and the resuits 


Of the application of these approaches discussed. 
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II.3. Methodology 


Prior analysis of budget data by Davis, Dempster, and 
Wildavsky and by Stromberg have generally commenced with the 
assumption of a simple model for Congressional budget behavior 


of the following form: [Stromberg, 1970,p. 61, DDW,1971,p. 534): 


where We appropriations in year t 
xX, = request in year t 


Ey is a stochastic error or disturbance term 
assumed to be distributed N(0,0°) with the 
sequence te, } boUnemonoommEndependent and 
iGommiodlveEciustiarbuted (1.1.0. ) random 
variables. 


The constant coefficient 8 is then estimated using least 
squares and some criteria like R, the "coefficient of 
determination," is used to judge the adequacy of the "fit" 
of the model to the data [DDW,1971,p. 536, Stromberg,1970,pp. 21~ 
25].* In most cases [DDW,1971,p. 537] the R° was very high 
and thus more complex models employing additional variables 
seemed unnecessary. Both DDW and Stromberg attempt to deter- 
mine breakpoints — years in which the coefficient 8 shifts — by 
performing a hypothesis test originally suggested by Chow[Chow,1960]. 

"The principal selection criterion (among alternative 


models) is the criterion of the maximum adjusted correlation 
coefficient, R" [DDW,1971,p. 536]. 
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II.3.1. Methodological Problems 


There are several problems with this methodology. The 
difficulties can be classified under three subject headings 
labelled problems with (i) the model, (ii) fitting techniques, 
centered around the use of least squares, and (iii) goodness 
me fit criteria, exemplified by the use of Ro as a criterion 


of closeness of fit. 


(i) Model Specification Problems 


One of the first problems is that the model specified in 
(2.1) assumes no interdependence between either 8B and x or 
between € and x. As suggested earlier, if Congress were 
uSing a percentage appropriation or percentage cut decision 
rule, it may conceivadiy be the case that the percentage 
changes with the size of the request. The problem would not 
be noticeable if all requests for a particular agency (in the 
case of DDW) or for a particular functional category (in the 
case of Stromberg) were of approximately the same magnitude. 
In the Stromberg study, however, requests for Procurement 
Aircraft and Missiles, Navy, for example, ranged between $380 
Mmeijion in 1953 and $3.06 billion in 1963. Certainly it will 
be desirable to let available data speak for itself on this 
issue of interdependence between size of request and magnitude 
of the percentage appropriated, and on others to be discussed. 

Another type of interdependence is that between € and x. 
As will be discussed later and as is well known [Scheffe,1959], 


the optimal properties of least squares depend on the sequence 





{e,} being uncorrelated, identically distributed, and having 
var(e, ) = on a constant. Under such conditions the Gauss- 
Markov theorem is applicable, guaranteeing efficiency of 
least squares estimates among the class of estimators linear 
in the observations. Were this condition to be true, and 
considering a model of the form of (2.1), then as the 
request gets larger the spread of actual data points or 
variance around the regression line remains unchanged. Both 
DDW and Stromberg view the error term, E,, as a random shock 
or variation from the usual percentage associated with 
"special events and circumstances relevant to particular 
years." [DDW,1971,p. 534, and Stromberg,1970,p. 8]. However, 
it is likely to be the case that the "special circumstances" 
that affect a $380 million request are neither the same nor 
create the same distribution of external affects, as the 


circumstances that affect a $3.06 billion request. 


(41) Least Squares Problems 


The Gauss-Markov theorem insures that if the Ey are 
i.i.d. then least squares eStimators are best (minimum 
variance) linear unbiased estimators. If (2.1) were 


the true underlying model, then provided that an 





estimate that is linear in the data is desired, the least 
squares estimate would appear to be a logical choice. 

However. as mentioned previously, one obvious form of 
lack of homogeneity of variance of error terms — of changing 
with a is certainly possible. In fact, since the regression 
line passes through the origin in equation (2.1) and negative 
appropriations are not possible, there is a lack of 


homogeneity of the error variance built into the model. 


Another problem is the presence of cutting or spending 
moods on the part of Congress. Perhaps the residuals in the 
nineteen fifties decade are predominantly positive, and those 
in the sixties negative. One way DDW try to take this 


Seplicitiy inte account is by using a classical autoregressive 


= + 
where Ee, PE, 5 tee 


and the sequence {v,} is one of independent 
identicadaay distributed random variables. 


However, in most cases they were only dealing with 8 


observations (at the most 16 and at the fewest 3). 
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Both DDW and Stromberg attempted to detect changes in the 
coefficient B, changes which could be associated with a cutting 
or spending mood on the part of the Congress and the public. 
DDW ran an F test for each agency on the residuals from the 


following regressions [Chow,1960]: 


+ ¢€ 


Jae ~ Firat lt 


an 3 


Yor = BoXoy ot 


Se Glamers [DDW,1967,p. 278] 


where the first equation is for the first n, years of data, 
the second equation is for the second N5 years of data and 
the: third equation is for the pooled (n,tn, =n years) data. 


The value of n. was allowed to vary between 3 and n-3 so that 


i! 
all possible break points were tried. In addition to the 
problems with using this approach on data sets 

numbering 8 or fewer, there is the added difficulty that the 
meee assumes the error terms are normally distributed. 
Simpler approaches, such as examination of residuals, may 

me more informative. 


Stromberg attempted to detect changes in € by analyzing 


the model 


<u 3s where 


Ye = ByXi_ * BoXoe 7 & 


es (the request) for the first n, years of data and zero 


for the remaining years and Xog = Xp (the request) for the 
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last n, years of data and zero otherwise. He then ran a 
t-test for the equality of By = Bo: In his formulation the 
{e,} are assumed to have the same distribution before and 
after the shift in the parameter 8&8. 

In many cases least squares may be more of a hindrance 
than a help to inference and achieving an understanding of 
the data. Se Geel te outlying observation may seriously 
aifect-the #teast squares~-eotimate. As-an illustration, 
consider the situation portrayed in Figure I1I.1, when the 


regression line is forced through the origin. 


\ 
AN 
Ee] 


Least Squares Line 





FIGURE I1I.1 


The upper.point in Figure II.1 may be a genuine outlier. 
mor example, the President may have sent to Congress an 
urgent request for more funds. This request, occasioned by 
an international crisis, may have been made in March after 
the budget was submitted and consequently was not reflected 


in the budget request data analyzed. Because ieast squares 
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minimizes the squared deviations from the regression line, 
the resulting estimate of 86 in certain situations is apt to 
be a meaningless compromise. Least squares will produce a 
line with a slope larger than the true £8 in order to avoid’. 
the one extremely large deviation. 

Another situation that might arise is depicted in 
Figure II.2, where the regression is not forced through the 


ere in. 






Least Squares Line 


PeGUan: J. 2 


Even if the previously discussed problems are not present 
ama the error terms are i.i.d., the Gauss-Markov theorem only 
insures that least squares estimators are the best from the 
meerss Of estimators that is linear in the observations. 
Anscombe [1967] and Huber [1973] have proposed so-called 
robust* estimators which are modifications of the least squares 
Eevimators but which are not linear in the observations. 


These estimators appear to be especially appealing when 
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assumptions of equality of the variance of the disturbances 
are not satisfied and when long tailed error distributions 
are suspected to be in operation.* Robust estimators will 


be discussed in more detail in later chapters. 


(iii) The Use and Interpretation of Re 


Davis, Dempster and Wildavsky propose the use of RC , the 


coefficient of determination, adjusted for degrees of freedom, 
as the criterion for the closeness of fit of the data to 
their models. There are two major problems with this 
Seacverion. One was pointed cut by Stromberg; the other has 
apparently not been discussed before in the context of the 
Subject matter being investigated. 

2 


The usual computationai formula for the estimate of R 


obtained from a sample is: 


T 
> he 2 
t=1 
m.6) L 3 = 
rE (y, -¥)* 
t=1 
where ey is the oe residual from the fitted function, Ve 
is the t*? observation on the dependent variable, and y is 


the sample mean of the dependent variable. Dividing the 


Memerator and the denominator of equation (2.6) by T, it is 


SAnscombe [1967] and Huber [1973] contend that in practice 
@eeis Gifficult to distinguish between the effects of hetero- 
meeGgesticity and long-tailed error distributions. 
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co see why Re is usually proposed as a criterion and given 

an interpretation of 1 minus the unexplained variance as a 
percentage of total variance of the dependent variable, or 

the explained variance as a percentage of total variance. 
Stromberg [1967, pp. 21-24] has pointed out that when per- 
forming regression with an intercept forced to be equal to zero as 


Le 
in equation (2.1) the interpretation of = as the (sample) 





unexplained variance is not correct. Regression*-with an 
unconstrained term insures that the e, the mean residual, 
Will be identically zero since when the partial derivative 
of the function to be minimized (Ze, °) with respect to the 
intercept is set equal to zero, the resulting "normal" 
equation automatically sets e equal to zero. However when 
a, zero intercept is assumed, this normal equation is not 
a result of the least squares minimization and e either may 
or may not equal zero. Injecting e into equation (2.5) won't 
help since then one could theoretically obtain a large Re 
when the average error around the regression line was large 
but the spread around the average small. 

Stromberg [1967,p. 24] and the BIOMED statistical package 
[BIOMED ,1965,p. 233] have addressed the problem by computing 


eesomewhat different number than that in equation (2.5). 


They have computed, what Stromberg called we 
AE 
y e,° 
(2.7) wea y - SA 
2, y.° 
t=1 
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The interpretation given is that we gives 1 minus the 


unexplained variation about zero as a percentage of the total 


variation of the dependent variable about zero. 


The problem 


here is that when performing regression under the assumption 


of a zero intercept, zero appears to be chosen as the point 


around which variation is computed more for convenience than 


for any other reason. 


Also, suppose that e happens to equal 


zero or be near zero (which will be the case if regression 


with an unconstrained intercept would have yielded a zero or 


near zero intercept) then with 
always be the case with budget 
the corresponding Re value and 
to someone used to Thanking an 

Another vrohbjJem with Ro is 


of its usefulness as a tool in 


Similar to (2.1). 


positive y,'s (which will 
data) we will be larger than 
could be somewhat misleading 
terms of ae 


associated with the question 


analyzing budget models 


The model (2.1) states that appropriations 


equal a percentage of request — plus’ some random error term. 


There should be no difference between this statement and 


the statement — on the average 


merecentage of the-request — or 


ae. 8) ae) 7 Y%Xy + 


Maetact, if (2.1) is a correct 


(l-y) and ce, should equal -& 


t t 


Congress cuts a certain 


ae 


model, then 8 Should equal 


Since 


TI-18 








(x, - ¥4) a YX, 1 be 


A Cl 2 a 

DDW and Stromberg have always tested equations similar to 
equation (2.1) and in general reported that the models were 
appropriate since the Re (or W°) values were .98 or .99. 

A simple test of equation (2.8) on data which yields a 
~99 Re for equation (2.1) will reveal a much reduced RO. 
(For example, data which yielded a .98 Ro for equation (2.1) 
achieved only a .26 R° for Sqiiei O10 20) 2) 

The point of the discussion is that the value for the 
measure of fit, Ree is going to be sensitive to the way the 
model for appropriations is formulated because changes in 
mocatten of the dependent variable alter the distribution 
Ohi Foe Powa result, one should probably not use Re as an 
absolute measure of fit and should instead consider it in 
a probabilistic context. 


II.3.2 A DeScription of the Methodology Employed 


.S 


Methodological problems with analysis of appropriations 
behavior were divided into model specification, least squares, 
mae che use of Re Hoe oeCipme  OMmOlrmeLOseness Of fit of the 
data to the model. The methodology of the analysis of this 


chapter was designed to address these problems. 


Blotting 


Tne first approach to the data involved constructing 


scatterplots of appropriations and requests. Scatterplicts 
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graphically indicate the range of the data and the character 
of the relationship between the variables, whether it be 
minmear Or curvilinear. Scatterplots also reveal potential 
outliers. Stem and leaf plots* of the appropriations and 
mae requests Were also constructed to display the range and 
variation of the data. In addition to being revealing, 
Sscatterplots and stem and leaf plots are relatively easy 
to construct, for example using the SNAP/IEDA computer 
package from Princeton University. 

Plots of the logarithms of the data were also analyzed. 
These plots were useful for consideration of the models 


ese chnrovugh 2.5. 


Alternative Models and Estimation Procedvres 


Five alternative models were chosen as potential repre- 
Sentations of appropriations as a percentage of requests. 
These were the models (2.1) through (2.5) discussed in 
seetion I1.3. It shouldbe noted that the models, except 
for (2.1), are proposed in order to take into account 
systematic interdenvendence between the request, Xi and the 
percentage appropriated, the error component, or both. They 
are not designed to take into account cutting or spending 


moods, Or eras in the mood of Congress. 


*Stem and leaf plots are discussed in Appendix A. 
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Our attitude towards the models is that (a) they cannot 
be universally or perpetually valid — one can specify repre- 
sentations of more complex behavior on the part of Congress -, 
but (b) such simple models do aid in understanding the 
phenomena underlying the data. In particular, estimation of 
model coefficients by various techniques, and subsequent 
examination of the residuals around the fitted relationship 
is a procedure that brings to attention certain historical 
periods and possibly certain programs that differ from the 
average or “usual." The fact that such periods (or eras) and 
programs exist should give pause to those who would attempt 
to mechanistically predict future appropriations behavior on 
the basis of simple models fitted to historical data. On the 
other hand, simple fitted models can provide a starting point 
for future discussions of amounts to be appropriated [Senate 
Report 93-688]. 

The simplest model of appropriations behavior, that given 
by equation (2.1), was estimated using ordinary least squares. 
The residuals were analyzed using scatterplots, and stem and 
leaf plots of both the residuals themselves and their absolute 
Value. 

The model specified in equation (2.2) was analyzed in two 
Ways. The least squares estimator for 8 is the mean value 
mer the ratio ¥,/%, However, the median YX, ratio was 
miso utilized in order to provide a robust estimator of 6 
[Andrews, et al.,1972], and to allow outliers to reveal 


themselves more clearly. The residuals for both estimates 
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were compared on the basis of scatterplots and stem and leaf 
plots, both of the residuals themselves and of their absolute 
malue. 

Equation (2.3) specified a percentage type model, in some 
ways similar to equation (2.1), except that the error term is 
multiplicative. It is also similar to equation (2.4) except 
that the percentage of the request that is appropriated does 
not change with the size of the -request. The coefficient 8 
was estimated in two ways. The first method, which yielded 
the least squares estimate, was to use the mean difference of 
in y, - Inx, as the estimate of ln 6. The second approach 
was to use the median difference. Residuals for both 
estimates were compared. 

The model specified in equation (2.4) is linear in the 
logs. Consequently logarithms were taken of both sides of 
(2.4) and the coefficients a and 1n 8 were estimated using 
least squares. Residuals were analyzed as in the case of the 
other models. 

If one takes logarithms of both sides of (2.5) the 


mesulting equation is: 


(2.9) ln Vg 10) 53a ainx, + Uy In xX, 


As suggested in [Johnston,1963,p. 211 J, estimates of a 
and ln 8 were computed by applying least squares after 
dividing both sides of equation (2.9) by ln x. * 


*Since none of the values for the X, 1S were equal to 1.9, 
vision by in X, Was always possible. 
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In the cases of (2.3) - (2.5) it should be obvious that 
B was not estimated directly; rather, estimates of ln B were 
derived. The exponential of this estimate was used as the 
estimate of 8. 

It should be pointed out that the goals of estimation at 
this point in the analysis are exploratory and tentative. 
One question to be answered was: which methods tolerate anda 
reveal these outliers more effectively — the log transformed 
data or the basic numbers, mean estimators or median estimators. 
Another question to be answered was whether or not any of the 
approaches (analysis of the logs, use of medians or means) 


revealed cutting or spending moods. 
med, Data 


Two basic batches of data were analyzed. The first were 
procurement requests and appropriations for the years 


1953-1973. The following categories were analyzed as a group: 


Procurement Equipment and Missiles, Army (PEMA) 


Procurement Aircraft and Missiles, Navy (PAMN) 
Procurement Aircraft, Air Force (AF A/C) 
Procurement Missiles, Air Force (AF Missiles) 


Twenty one years of data and 4 observations per year should 
yield 84 total observations. However, from 1955-1958 either 
PEMA requests or appropriations or both were zero and 
consequently were not included in the data. Thus, to start 


with, 80 observations were analyzed. The above four categories 





constitute roughly three quarters of annual Defense Department 
procurement requests anc appropriations. The only major 
category not included which is comparable in size to these 

is Shipbuilding and Conversion, Navy. The high and low 


appropriations figures for each category were Gin bilitens ): 


Category High Year Low Year 
PEMA $5 .462 1968 .971 1960 
PAMN 52755 LOZ eile, 1953 
APAF 8.048 oS 2.072 1955 
MPAF 2.903 JSD 3 796 1966 

TABLE seed 


HIGH AND LOW APPROPRIATIONS FOR PROCUREMENT 
APPROPRIATIONS IN THE PERIOD 1953-1973 


The data from the years 1953-1968 were taken from 
Stromberg [Appendices A and BJ]. Stromberg took the appropri- 
ations categories of 1968 as given; he then traced all 
category changes back to 1953 (there were few in the procure- 
ment area) and thus reconciled the data with the 1968 
categories. The 1969-1973 data were reconciled with the 1968 
categories (there were several changes in the Army procurement 
categories in 1972) by the author. The basic data was 
obtained from [U.S. Senate, Budget Estimates and 
Appropriations, 1969-1973]. 

The second batch of data analyzed was the Research and 
Development (RDTE) requests and sppropriartoms amor 1953-1975 


for the Army, Navy and Air Force. In all there were 63 
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observations. The high and low appropriations figures for 


each category were (in billions): 


Category High Year Low Year 

RDTE Army $1.839 1972 55 1956 

RDTE Navy 2.545 uS{(S' ~059 1954 

moOrr Air Force 3.632 1963 418 £955 
PR lsF wl I. 2 


HIGH AND LOW APPROPRIATIONS FOR RDTE 
IN THE YEARS 1953-1973 


The data sources for RDTE were the same as for the Procurement 
data. A listing of the data can be found in Appendix D of 


mais chapter. 


m5, Resturts 


Prialysis of the procurement and RDTE data using the 
approaches discussed earlier yielded some notable differences 
in results depending on which approach was used. These 
differences can be summarized under the eavegories of Gilter= 
ences in coefficients, residuals and outliers, and closeness 
of "estimated" appropriations to actual appropriations, i.e. 


predictability. Each of these topics will be taken up in turn. 


mme5.1. Coefficients 


The three models (2.1) - (2.3) all say that 
on the average, appropriations are a constant percentage of 


the request. The models differ in the way the random 
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component enters their specification. The estimated 
coefficients using the different approaches for procurement 


and RDTE data are presented in Table I1.3. 


Model Procurement RDTE 
f2.1) 959 982 
(2.2) median .974 .989 

mean .993 1b SIG 
(2.3) median 977 .990 
mean 973 1.030 

TAB UE alk 


ESTIMATED PERCENTAGE OF REQUEST 
APPROPRIATED USING (2.1) - (2.3) 


Although it is difficult to assess the differences among te 
various "estimated" percentages without some measures of 
Stability or variability of the estimates or without examining 
how close the eStimated appropriations are to actual appropri- 
ations both for the set of data analyzed and for an independent 
set, some differences stand out immediately upon examination 
Br Table I1.3. The least squares estimates for the percentages 
meme? 1) and (2.2), that is .959,and .993,for procurement are 
different from the other estimated percentages which are 
about .975. Confidence intervals computed in Chapter TV wale 
allow us to assess the importance of Such differences. 

A second interesting aspect of the estimated coefficients 
is that for the RDTE data the least squares estimates for 


Mee?) and (2.3) are greater than 1.0, being 1.09 and 1.035 
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respectively. In other words, according to these results the 
Congress has increased the appropriations over and above the 
request by some percentage figure. This result is somewhat 

at odds with the current view of Congressional activity in 

the RDTE area, that is,the view that the Congress is carefully 
serutinizing and cutting RDTE budget requests. These and 
other points will be pursued further in the discussion of 
Bacliers and closeness of "fit. 

The two models (2.4) and (2.5) both say that Pie percentaee 
of the request which is granted depends on the size of the 
request. The difference between the two is in the specifica- 
tion of the errorcomponent. The estimated coefficients for 


both (2.4) and (2.5) are presented in Table II.4. 


Model Procurement RDTE 
sha ot ete 
oe, 4) .948 384 ~903 <f 02 
(2.5) .969 wel ~852 12056 
TABLE II .4 


ESTIMATED COEFFICIENTS FOR (2.4) AND (2.5) 


The coefficients themselves are not immediately informative. 
More informative are the percentages they imply. For example, 
what percentage of the request do the estimated coefficients 
imply would have been granted for a request of 3 billion 


dollars. This can be assessed by computing ~_ 


A 


» where 


B and a are the estimated coefficients and Xy Heche oumele Gs 
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ease, 3.0. The following table gives the average, maximum 
and minimum percentage of the request that the estimated 
coefficients imply would have been granted for the procurement 
and RDTE data that was analyzed. This provides a method of 
comparison of the estimated percentages from the approaches 
(2.1) - (2.3) with the results of the estimation of coeffi- 


@ients for (2.4) and (2.5). 


Model Procurement RDTE 
High Average Low High Average Low 
(2.4) 1.14 .981 .91 5 el OS soul 
fe. 5) 1.06 See. ole: IS Les so 
TABEESE <5 


ESTIMATED PERCENTAGE OF REQUEST GRANTED 
FOR DATA ANALYZED 


Appendix C to this chapter contains stem and leaf plots for 
the percentages for procurement and RDTE using the approaches 
me (2.4) and (2.5). 

An interesting guestion is: How does the "estimated" 
percentage of the request granted change with changes in the 
size of the request? As the request gets larger does the 
percentage granted increase or decrease? (Of course the 
approaches in (2.1) - (2.3) assume the vercentage remains 
the same.) This sensitivity of the percentage granted to 
the size of the request can be investigated using the 


elasticity of the percentage with respect to the request or 
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ex we ex . The following table gives the "estimated" 
Sieeticities for (2.4) and (2.5). Note that these elasticities 


ean be obtained by computing (a-1). 


Model Procurement RDTE 

(2.4) -.052 -.097 

fm .5) -.031 -.148 
TREBLE ITE.6 


ELASTICITY OF PERCENTAGE OF REQUEST 
GRANTED WITH RESPECT TO SIZE OF REQUEST 


What these elasticities say is that for a 100 percent increase 
in the request there is an estimated 5.2 percent decrease in 
the percentage of the request granted (in the case of procure- 


ment, with Model (2.4)). 


mr. 5.2. Residuals 


As discussed in the methodology section, for each approach 
Beeitadueals were calculated and these residuals were then 
Studied. An initial study of the residuals was undertaken 
(1) by making stem and leaf plots, and (ii) by making scatter- 
plots of the residuals versus the request for (2.1)-(2.2) or 


scatterplots of the residuals and logs of requests for (2.3)-(2.5). 
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or 


Mne residuals studied were the results of the ‘following 


arithmetic computations for each approach. 


A 





me. 1') y, ~ Bx, 
me .2') vt =: 
ic 
a .3') Iny, - nx, ~ la & 
(2.4') In ¥,- In x, - i 
.5') = Bra eine Ce 


iiessvem and leaf plots and scatter plots are 
contained in Appendix B of this chapter. The results for the 
procurement data will be discussed first, followed by a 
mrscussion of the results for the RDTE data. 


mescussion of the Proctirement Analysis: 
Residuals and Outliers 


For each model, as applied to the procurement data, the 
following items or data points produced residuals whose 
absolute values were more than twice the mean absolute 
residual for that model. This provides a preliminary 


m@aitcation of outliers. 
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Negative 


Positive 


1954 
1954 
1954 
a ous. 


1954 
1 
oe 


AF A/C 
Hino SL bes 


PAMN 
PEMA 


PEMA 
PEMA 
PEMA 


1960 AF Missiles 
1963 PAMN 


PRB LL. 7 
ITEMS PRODUCING LARGE RESIDUALS 

These results reveal some interesting aspects of the 
phenomena giving rise to the data. First, 1954 probably is 
not part of the same population as the rest orf the data. 
Appropriations for fiscal 1954 were approved shortly after 
the signing of the Korean truce and appear to be affected by 
this event. (The high positive residual for 1954 PEMA — 
representing a Congressional appropriation of over $2 billion 
after the Army requested only $1 billion — either represents 
meaesire to replenish Army stocks at the close of the war or 
possibly is related to the very large cut of the 1953 PEMA 
request — a cut of almost $700 million). Until 1960, PEMA 
is a very unstable category. In 1957 and 1958 during the era 
when "massive retaliation" and "brinksmanship" were the 
popular foreign policy doctrines [Crecine and Fischer,1971l,pp. 
25-26] there were neither requests nor appropriations for 
PEMA. The large positive residuals for fiscal 1962 PEMA and 


fiscal 1963 PAMN reflect the increasing emphasis on conven- 


tional weaponry in the early 1960's and may very well be 
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related to Congressional concern over the unsuccessful Bay 
of Pigs invasion of April 1961 and the growth of the Military 
Assistance Command, Vietnam. 

A second interesting aspect of analysis of the residuals 
is that almost all post-1969 items yield negative residuals; 
moO, these are of approximately equal magnitude. In other words, 
the various simple model approaches imply that recent appropri- 
ations have been higher than they actually were. For example, 
using the approach of (2.1) on procurement data, thirteen of 
the twenty most negative residuals were for post-1969 items. 
This grouping of the post-1969 items suggests that decision 
rules changed in 1969 and the Congress began cutting a larger 
percentage of the request. In other words, there is some 
Evidence that an era terminated and the mood of Congress 
changed. Since the data analyzed represent total Congressicnal 
merivity, this analysis does not pinpoint whether the change 
Was associated with any special Congressional Committee. Other 
research [Laurance 1973 indicates that the Senate Armed Services 
Committee began to closely scrutinize defense authorization 
requests at this time. The only peculiar aspect of the fiscal 
1969 budget picture was that since it was acted on in calendar 
1968, it was an election year budget. Also, fiscal 1968 
represents the low point in President Johnson's popularity and 
a low point in public support for the Vietnam war [Gallup ,1968] . 

Cne notable difference in the residuals for the different 
approaches is associated with the pre-1966 items of less than 


$2 billion. The analysis of the logarithms, the approaches of 
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es) chrough (2.5) , grouped together a number of 
pre-1960 items that were under $2 billion, several being under 
$1 billion, as the most negative residuals. The other 
approaches, which used the original untransformed numbers 
Gid not group these residuals together. For example, under 
the approach of (2.4) 1953 PAMN with a request of $124.5 
million and appropriations of $113 million produced the 5th 
most negative residual. Under the approach in (2.1), the 
residual was slightly negative and ranked 33rd (of 80) among 
residuals ordered from most negative to most positive. A 
similar result occurred for 1956 PAMN with a request of 
$945 million and appropriations of $804 million. 

The particular phenomenon observed here resembles one 
discussed by Daniel and Wood[1971,p. 25] as part of a general 
meeseussion of the distribution of data points. The plot of 


requests versus appropriations looks something like Figure I1.3. 





At 


PoleURE 21. 3 


iTS se 





As Daniel and Wood point out, if least Squares is used 
On data similar to that in Figure II.3, the cluster at the 
right acts as essentially one point in determining the slope. 
The slope is, thus, highly dependent on the extreme values 
at $100 and $800 million. Thus, the residuals for those 
items will tend to be small. If the approach of (2.3) is 
used and the median difference between lin Ve and in Xp is 
eroloyec as an eStimator for In-fé, then alleef the points 
have equivalent importance in determining the estimate. 

In the discussion of coefficients it was pointed out that 
the .959 coefficient obtained by using least squares on (2.1) 
was somewhat different from the coefficients produced by the 
other approaches. In the stem and leaf plot of the residuals 
(po. RB-H) note the large number of residuals in the 0-300 
range. There are thirty-three negative residuals and forty- 
seven positive ones. The other approaches yield distribu- 
tions of residuals which are somewhat more symmetrical 
around zero. For example note the stem and leaf plot (p. B-15) 
for the approach of (2.4). There are forty-two negative and 
thirty-eight positive residuals. The pattern of the residuals 
and the estimated coefficients indicate that when least 
squares was applied to (2.1) a few large cuts, that is 


appropriations significantly smaller than requests, pulled 
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the estimated line down slightly; hence the smaller slope 
and different pattern of residuals. In later chapters 

we will explore estimation procedures which are not as 
sensitive as least squares to a few extreme observations. 
Discussion of the RDTE Analysis’ 

Residuals and Outliers 

As was true in the case of procurement, the residuals 
produced by applying each of the various approaches to 
RDTE data were examined in detail. The items which produced 
the largest negative residuals under each of the approaches 
were 1954 Army, Navy and Air’ Force. It should be noted that 
for the approach of equation (2.2), regardless of whether 
the mean or median of the ratio of appropriations to request 
were used. the result failed to yield @ negative residual 
that was larger in absolute value than twice the mean of 
the absolute value of residuals. 

The largest positive residuals under each of the 
approaches were produced by 1955 Navy, 1962 Air Force and 
1960 Air Force. 

The procurement data showed a marked tendency for post- 
1969 items to have negative residuals for each model-fitting 


approach. However, the RDTE data do not yield such definitive 
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results for all approaches. The approaches of (2.1) (2.3), 
especially (2.1) , show this tendency; in each case, 
after the residuals were ordered from most negative to most 
positive, 13 of 15 post-1969 items yielded residuals in the 
lower half of the ordered set. However, this was not true 
for the variable percentage approaches of (2.4) and (2.5). 
Under these approaches the mest negative residuals were 
recorded for the pre-1960 items, where the request and cut 
were both small in absolute terms but the cut was large when 
considered relative to the request. For example, under the 
approach of (2.5) the seventeen most negative residuals were 
mer items less than $1 billion, prior to 1960. 

insight can be gained into this pattern for the approaches 
Meme. ) anc (2.5 ) by recalling the elasticities presented 
mae rable I1.6,. Under the approach of (2.5) the estimated elasticity 
of the percentage of the request granted with respect to the 
size of the request was -.148. Consequently, the percentage 
of a 500 million dollar request that is appropriated was 
estimated to be .148 larger than the percentage of a 1 billion 
dollar request that is appropriated. The RDTE data varied 
between requests of $54 million and $3 billion. Any items 
for which the request was less than $1 billion and 
Which did not show appropriations greater than the request 
yielded large negative residuals when the approaches of (2.4) 
and (2.5 ) were used. 

One feature of the analysis of RDTE residuals which did 


not appear in the analysis of the procurement data was the 
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almost exclusive domination of the extreme positive residuals 
by the Air Force. No matter which approach was used, the 
residuals for the Air Force were among the most positive. 
Re-examination of the data reveals that except for small 
reductions in 1966 and 1968, between the years 1957 and 1969 
the Air Force either received almost exactly whatit requested or, 
more often, received more than it requested. This pattern 
is not present for the Army and Navy. This leads one to 
suspect that the Air Force data for the 1957-1969 period may 
be part of a population different from the Army and Navy data 
of the same period. On the other hand, the 1970-1973 data 
reveal no notable differences between Congressional action 
On Air Force RDTE requests and Congressional action on Army 
ana Navy requests. 

A final point of interest is associated with the distri- 
bution of residuals under the various approaches. The 
stem and leaf plots in Appendix B show two interesting 
features. First, once again the approach of (2.1) yields 
many small positive residuals. Nove the large number of 


mesiduals in the 0 to 150 range. This is very similar to the 


pattern that appeared in the procurement data. The second 
interesting feature is associated with the difference in the 
S@octribution of residuals for the mean and median when used 

as estimators for the coefficients in the approaches of (2. 2) 
and (2.3). Recall, that the mean and median yielded 

different estimated coefficients (1.09 to .99 for (2.2) and 


1.03 to .99 for (2.3)). Next, note that when the mean is used 


eS ol 





as the estimator almost all of the residuals are negative — 
small in absolute value but negative nevertheless. In fact, 
when the mean is used as the estimator for the coefficient 
for the approach of (2.2), 57 of 63 residuals are negative. 
When the median is used exactly half are negative and half 
are positive, with one residual being zero.* 

Closer examination of the data indicates that at least 
one apparent outlier, 1955 Navy, where the request was $61 
million and appropriations were $419 million, is significantly 
affecting the least squares estimate — the mean — both for 
the approach of (2. 2) and the approach of (2.3). 

The scatterplots of the residuals versus the predicted 
values and of the dependent variable versus the independent 
Variable (see Appendix B) for approaches (2.4) and (2.5) are 
also revealing. In both cases, for small predicted values 
the residuals tend to be negative while for large predicted 
values they tend to be positive. 

A careful examination of the log x, versus log y, scatterplot 
for the approach of (2.4) on page B-33, however, reveals that 
there is one extremely large positive residual associated with 
a very small value for the independent variable. That is, the 
scatterplot and the estimated regression line look something 


like Figure II.4. 


*FPor the approach of (2.3) 48 of 63 residuals are negative 
when the mean is used as estimator while half are negative and 
half are positive when the median is used. 
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FIGURE II.4 


This data point was, again, 1955 Navy RDTE for which the 
request was $61 million and apprcpriations were $419 million. 
It appears that this outlier, which is affecting the least 
squares estimates for (2.2) and (2.32), is also affecting the 


least squares results for the approach of (2.4). 


Mie>.3. Predictability 

teeenumerical values of the residuals under each of the 
Seproaches are, of course, not directly comparable. For 
example, the residuals from the approach of (2.1) are found 


eye computing, for each t, 


while the residuals obtained from the approach of (2.'2;) are 


Peund by computing 


Ve “ 
aa B 
1p 


ji = 39 





where the @'S were estimated using the techniques discussed 
earlier. 

In the past section the distributions of the residuals 
under each of the approaches were compared, as were the items 
associated with extreme residuals under each approach. But 
the numerical values of the residuals were not compared. 

Some assessment of the proximity of what might be called 
the predicted appropriation under each approach, Bx, for 
(2.1) ~ (2.3) and Bx,% for (2.4) - (2.5) and the actual 
appropriation (i.e. y,) is useful and informative. Consequently, 
the difference Y, - BX, (or y, - Bx,%) was computed for each 
item under each approach, both for the procurement and RDTE 
data. Then both the mean and median of the absolute value 
of this difference were computed. (The mean of positive square 
roots of the squared differences could have been used, but 
the mean of the absolute differences was considered to be 
less sensitive to one or two large values. The median of 
mae absolute differences was of course even less sensitive 
to extreme values.) Tables II.8 and II.9 contain the results 
of these computations. 

For the procurement data, all of the approaches yield a 
comparable mean absolute difference with the median as 
estimator for equations (2.2) and (2.3) yielding the smaliest 
values by 2 narrow margin.* The values for the median of 
m@eeQlute differences, ly. - ex, 1 and examination of the stem 


and leaf plot for the absolute differences indicate that when the 


*Since absolute differences are being used, it is to be 
expected that the median wili fare better than the mean under 
Maewapproaches of (2.2) and (2.3). 
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Mean Absolute 


Approach Difference 
o?.1) me = Bie + cp 253.2 
we .c) Ve 8 BX, fs Ep %y 
Median as estimator pen] 
Mean as estimator 262.8 
ay 
we .3 ) y, = Bx,e 
Median as estimator 249.7 
Mean as estimator EASY Oe 
mM a UL 
fo it ) Y, = 8x, e 254.9 
eee i 
(25) y, = 6x, “e : ocineG 
JIMBO MIS 5 18 
Procurement 
(Numbers are in Millions) 
Approach Difference 
2.1) ee ee Pe 89.3 
m2 .2) eee £ X, 
Median as estimator 90.3 
Mean as estimator 1SoGes 
ac 
fe .3) tig ae [Segal 
Median as estimator 90.3 
Mean as estimator AA ce 
i o1 Uy 
(2.4) i Bx, e Li. 
Uy in xy 
fe .5) a, 3a we ND 6S) 
TAB 9 
RDTE 


(Numbers are in Millions) 
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MedianAbsolute 
Difference 


eats, 


5) ay, 
eo 


Mean Absolute Median Absolute 


Difference 


50" 


Ge 9 


Barre 
(Bis 


50r.7 


71.9 





median was used as estimator for (2. 2) and (2.3) most of 

the absolute differences were small while an obvious few were 
quite large. It is worth noting that the mean absolute 
aifferences are very close for the approach of equation (2.1) 
and those of equations (2. 2) and (2.3) (with the median as 
estimator) but the median absolute differences are considerably 
different. 

Examination of Table II.9 for the RDTE data reveals that 
the mean and the median absolute difference are very large for 
the approach of equation (2.2 ) when the mean of the 
appropriations/request ratio is used as the estimator for 8. 
This is consistent with the discussion in the last section of 
the problems with this estimator for the RDTE data. The median 
used as an estimator under the appreaches of eauations (2.2 ) 
mma (2.3) and the approach of equation (2.1) all yield 
comparable absolute differences. This is to be expected, of 
course, since the estimated coefficients under each approach 
were nearly identical. Earlier, problems with the application 
of least squares to equations (2.4) and (2.5) for the RDTE 
data were discussed. The results in Table II. are consistent 
With that discussion. The mean and median absolute differences 
are considerably larger for (2.4) and (2.5) than for equations 


meet ) and for (2.2) — (2.3) (with the median as estimator). 
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m™o.5.4. Conclusions 


Each approach was used on both the procurement and RDTE 
data, and was evaluated by examining (i) the estimated 
@eenrticient or percentage of the request which was appropriated, 
(ji) the pattern of the residuals, and (iii) the closeness of 
the "predicted" appropriations to actual appropriations. For 


each approach the results will be summarized. 
2.1) et Bx, + E, 


The least squares solution, i.e. estimate of B for (2.1), 
produced an estimated coefficient for procurement which was 
different from those produced by other approaches. A more 
complete assessment of such differences willbe possible after 
confidence intervals are constructed in Chapter IV. 

Examination of residuals for this approach reveals rather 
disturbing patterns. For example many of the residuals are 
Me@oicive, which indicates that the classic least squares assump- 
tions possibly are not being satisfied. Also, the distribution 
Bf data points is such that it appears that least squares 
forced the line directly through the low value data points. 

The closeness of fit of the predicted appropriations to actual 
appropriations for this approach was camparable to that of 
approaches (2.3) and (2.2) when the median was used as the 
mecimator for these approaches, closeness of fit being 

measured by the mean ly, ~ Bx, |. The median ly, ~ Bx, | was 
larger for the approach of (2.1). In summary, by the three 


standards used for evaluating the estimates, while the least 
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squares approach of (2.1) did not produce the worst estimates, 
neither did it produce the best. This is eSpecially true for 


the procurement data. 
ee. 4) y, = Bx 


This is a "variable percentage" approach where the 
percentage of the request that is granted varies with the size 
of the request. Estimates of the coefficients were derived by 
applying least Squares to the equation in the logs. The 
estimated coefficients for the procurement data were reasonable; 
they implied percentages that went from .91 to 1.14 depending 
on the size of the request, with an average of .98. However, 
mae estimated coefficients for the RDTE data implied percentages 
that went from .91 to 1.35 with an average of 1.03. It is 
wert tcult to imagine that for a request of a certain Size the 
Bongress would consistently vote an increase of 35 percent. 
mes approach yielded larger residuals for requests that were 
small in dollar magnitude than any other approach except for 
that of (2.5). However, for the RDTE data, the residuals had 
meaisturbing pattern, being negative for small requests and 
positive for large requests. The least squares line appears 
to have been unduly affected by some outliers. Insofar as 
eloseness of rit is COMeeCEnCC Nr Or che pPprecurement dela. tre 
average ly, - Bx," difference when this approach was used was 
comparable to that yielded by other approaches. However the 


median absolute difference was larger. 
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For the RDTE data the closeness of fit was not as good 
as that for (2.2) and (2.3) — when the median was used — nor 
Meet hat for (2.1). However it was better than that for (2.2) 


mid (2.3 ) when the mean was used as an estimator. 


m>.2) ven = Bx, ct E,X, 


For this approach first the median and then the mean of 
the ratio Y,/X were used as estimators of 8. For the procure- 
ment data the estimates yielded by the mean and the median 
were comparable. However when the mean was used on the RDTE 
data the estimated 8 was 1.09. Examination of the data showed 
meu Lhis estimate, a least squares estimate, was also unduly 
affected by an outlier. For the procurement data, the 
residuals' pattern was reasonable and similar for both mean 
and median as estimator — separating out some early 1950's 
items — and pointing to a possible change in Congressional 
attitude after 1969, since many post-1969 items had negative 
residuals. For the RDTE data the least squares estimator, 
the mean, revealed almost all negative residuals with a Trew 
large positive ones. The residuals when the median was used 
as an estimator appeared reasonably distributed. For example 
there were as many positive as negative residuals. Insofar 
meecioseness of fit is concerned, for the procurement data, 
the median showed the smallest mean (and median) absolute 
Pereor or yg - BX, | Oi valt Gace appreaenes, emacept for the 
median using the approach of (2.3). The mean also showed a 


me@etl absolute error. However, for the RDTE data, while the 
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median again showed small mean (and median) absolute errors 
the mean showed the largest. AS mentioned earlier it appears 
that a few outliers seriously affected the estimates when the 
mean was used for the approaches of (2.2) and (2.3). 


Uy, sbi Xe 


(2.5) ai the 


iis approach is very Similar to that of (2.4). Coeffi- 
Mmecnctcs were estimated by taking the logs of bOth Sides, 


Saviding the equation by log x, and applying least squares. 


8 
The results are comparable to those for the approach of (2.4). 
Coefficients imply reasonable percentages for procurement data 
but not for RDTE data. Residuals show an undesirable pattern, 
from the standpoint of least squares, for RDT& data. The 


closeness of fit is not particularly impressive, especially 


mor RDTE data. 


u 

ms) y, * 8x,e 

The coefficient was estimated by using the median -and the mean 
of Iny, - Inx, as an alternative estimate forln8. The estimated 
coefficients are comparable to those of the approach of (2.2) 
mer DOtH procurement and RDTE data. In other words, both the 
median and mean yield reasonable estimates for the procurement 
data but the mean estimate for the RDTE data is affected by 
Gutlier values, although not as much as the mean for the 
approach of (2.2) .* Closeness of fit results are also 


comparable to those of 2 De 


R XB using the mean as estimator for (2.2 ) is 1.09, while 
8B when using the mean as estimator for (2.3) is 1.03. 
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m.6. Summary 


Bes chapter represents an effort to analyze historical 
data under the general working hypothesis that Congressional 
Meoton on budget requests usually results in percentage 
adjustments at the appropriations level of aggregation. In 
the first section the problem of modelling appropriations 
as a percentage of requests was discussed. The second section 
was devoted to methodology. The prior work of Davis, Dempster 
and Wildavsky and of Stromberg was reviewed and problems with 
Their approaches were pointed out. The methodology used in 
this study was then presented. Alternative formulations of, 
miemapproaches to,the percentage adjustment hypothesis were 
analyzed by examining estimated coefficients, residuals, and 
closeness of fit of the "predicted appropriations" to the 
actual appropriations. Each approach produced somewhat 
@meeerent results. The most consistent results, on the basis 
of various criteria discussed in section II.5, were produced by 
the following approaches when the median (of the appropriations 
request ratio or the logarithm of the ratio) was used as the 


moeermatvor for 8 in: 


me. 2) ve 


I 
WD 
ms 

-4 
7) 


fe. 3) ¥, = Bx,e U 
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The other approaches, which all involve least squeres 
estimators in one form or another, produced at one time or 
another results which were either unreasonable or inferior 
in some way to the results of the approaches (2.2) and (2.3) 
when medians were used as estimators. 

The analysis of the residuals combined with an examination 
of scatterplots for the approaches if (2.2) and (2.3) — and to 
a certain extent for the other approaches for that matter — 
produced three very interesting substantive results. First, 
the 1954 data, coming as it did at the end of the Korean 
conflict, may not be part of the same population as the 
other data. Second, the post-1969 items, especially for 
procurement, show a definite pattern of negative residuals 
which indicates that the percentage adjustment very likely 
changed in 1969. Finally, in the RDTE data, the 1957-1969 
Air Force RDTE items dominate the positive residuals which 
indicates that decision-making on Air Force items may have 
been different from decision-making on Army and Navy items 
during that period. 

The chapters which follow will be devoted to analysis of 
the same set of data using an alternative estimation 
procedure. The results of using this alternative procedure 


will be compared with the least squares results. 
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APPENDIX A 


Stem and leaf plots are a method of quickly summarizing 
@ large set of numbers. They are discussed extensively by 
John Tukey in his forthcoming book, Exploratory Data Analysis 
[Tukey]. The basic technique is best described by means of 
an example. Suppose one wants to summarize the following set 
of numbers which might be percentages of requests appropriated 


using (2.4) for various sizes of the request: 


.981 965 921 sie eo oe .985 

995 O58 .943 .946 Ss: .986 

.990 943 .954 .988 .964 .981 

.982 .986 7 99> .985 .966 942 

.976 5512) .968 mee) (i Od .990 

m9) 997 .994 coe ao Ou: sole 
TABLE A.l 


A stem and leaf plot of these numbers with the stem being 
the first two digits and the leaves being the third digit is 


given in Figure A.l. 


92 1 
233 

94 2 
95 4 
96 4 
oN 
1 
moe (0 


Tene A | 


The plot reveals that the distribution is somewhat skewed 


Met and also emphasizes a potential outlier, .921. 
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APPENDIX B 


This appendix contains scatterplots and stem and leaf 
plots for residuals for each of the equations which were 
Subjected to analysis in this chapter. It should be noted 


maaat inethitssappendix the term "residuals" refers to the 


following: 
t == 
i. 1' ) Y emacs 
y a 
m2 .2') — -~ B 
i 
me, 3') Tienes — 11/6 
t = ais 
(2,4') IB) ee (8) 18s) oe 1B 8 
in y, A A 
t =a = rs 
(2.5!) in Xe 2 In B Sita 


Since in many cases the plots are not identified explicitly 
on the page on which they appear, the following list identifies 


each plot and gives its page number. For the lnx 
in y 
Iba Xe 


4 versus lin Ye 


scatterplot for (2.4) and the scatterplot 





—- versus 
15 a\y% 

for (2.5) the least squares lines are included. These are 
indicated by the symbol "YY". The pages containing the stem 
and leaf plots of the residuals contain a complete Jisting 


of the residuals in addition to various measures such as mean, 


meaian ana so forth. 
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2.1) ye = Bx, + €, 








Scatterplot* x versus y 
Stem and leaf of residuals 
meatterplot X versus residuals 


2.2) ie Bx, + E,X, (Median of ratio as estimate) 


Stem and leaf of residuals 
Scatterplot x versus residuals 


(2.2) = Bx, Rebs x (Mean of ratio as estimate) 


Cie 


Stem and leaf of residuais 
Scatterplot x versus residuals 


UL 
we. 3) Vy GS lest 


stem and leaf of residuals 
scatterplot Inx versus residuals 


Ve 
2.3) Y, = Bx,e 


Stem and leaf of residuals 
Scatterplot lnx versus residuals 


u 
me. 4) as Bx, e t 


Scatterplot lin x versus ln y 

Stem and leaf of residuals 
Scatterplot of lm y versus residuals 
Beatterplot of ln*~ x versus residuals 


Procurement 


Page 


Sauer 
WI Ww 


B-6 
Bas 


B-8 
Bey 


(Median of In ratio as estimate) 


B-11 


(Mean of ln ratio as estimate) 


B-14 
B-15 
B-16 
B-17 


ripe oe 
WW LOW WW 


ON Ud 


“The small circles on the scatterplots indicate that more 
than one observation has been plotted in approximately the 


Same place. 
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in Vy 


t in X, 


Stem and leaf of residuals 
Ly) V4 
Scatterplot of Coan) "ynredictead" 
n X, 
versus residuals 





ili 
Beavterplot of mes versus 


scatterplot of ln xX, versus residuals 
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APPENDIX C 


This appendix contains summaries (stem and leaf plots, means, 
Medians .and so forth, for "so-called" predicted percentages 
of requests which would be appropriated when the requests are 
of the size of the data analyzed in the chapter. These 
predicted percentages are obtained by computing px, °° 
where 8 and a were eStimated using (2.4) and (2.5). The 


following list states which values for 8 and a were used and 


identifies the pages on which the summaries can be found: 


Equation Number B 5 X, data set Page 
(2.4) 1.473 948 Procurement C-2 
(2.4) Poe sc ~903 RDTE C-3 
(2.5) ere 969 Procurement c-4 


(2.5). Oona »852 RDTE C-5 
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CHAPTER 112 


Bicroauction 


In the previous chapters, the institutional background 
of our analysis of Congressional defense budget activity was 
discussed, alternative approaches to models for characterizing 
this activity were presented, coefficients for characteristic 
equations were estimated and residuals analyzed. For example, 
it was pointed out that if the relationship between request 


in year t, x,, and appropriation, V,: is modeled as 


2.2) Y, = BX, + €, x, 


then the median of the ratio y,/x, is a useful estimator for 
Bee that is it is superior under certain circumstances to a 
least squares estimate because of the median's de-emphasis 
of extreme values. 

Following the work of Andrews, Tukey, ana others [Andrews, 
et al.,1972], this chapter explores the usefulness of an 
alternative robust estimator for the five different approaches 
discussed in the previous chapter. The esuimaccr slomeene 
Huber "M" estimator [Huber, 1973]. This estimator was 


originally proposed for use in the estimation of a location 


parameter [Huber, 1964]. However, it has recently been extended 
to and asymptotic properites have been derived for the regression 
problem [Huber,1973]. The format of this chapter will be the 
following: first, the basic problem of concern in robust 
regression will be discussed, with examples taken from the 
analysis of the previous chapter; next, the technique and 
rationale of Huber "M" estimates will be outlined; estimates 

for defense budget data for each of our five basic equations 

will be presented, discussed and compared with non-robust, 

least squares estimates. Finally, residuals and outliers 


will be analyzed. 
mers) The Problem 


Tt may be useful at this point to re-state the five basic 
equations that have been used in the analysis of defense budget 


data in the previous chapter 


fee .1) Y, = BX, + €, 
(2.2) va = BX, + Ep X, 
Us 

2.3) Y, = Bx,e 

Ol 
io. 4) aes Bx, € y 

itp bool ke 
25 ) a Bx, Y 


where Wes stands for appropriations in year ty, xX, stands forthe 


agency budget request in year t, ere and fu, } are stochastic 
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disturbances which are assumed to be independent and 
identically distributed. 

In the previous chapter, results (especially for the 
RDTE data) indicated that for (2.2) and (2.3) the median 
¥,/%, ratio and in(y,/x,) ratio were more useful estimators 
for B and ln B than were the least squares estimators. This 
4s because (i) for both (2.2) and (2.3), the mean gave 
coefficient estimates which were basically suspect. They 
indicated that the percentage of the request appropriated 
is greater than 1.0, a result which even a casual glance at 
plotted data did not support, and (ii) stem and leaf plots 
of the residuals and scatterplots of Ns versus X, and in V4 
versus ln Xy showed that a few extreme observations were 
having a disproportionate impact on the estimates. The 
robustness of the median as opposed to the mean is not 
particularly surprising since (2.2) and (2.3) reduce (when 
appropriate transformations are made) to a location parameter 


problem of the form: 


ve 0 + € where € iS a random variable with 
a given distribution and 
6 is a fixed constant 


Andrews, Tukey et al. found the median to be more robust 
than the mean for the location parameter problem [Andrews, 
femciee 19{e)]. Aliso, see ([Mosteller,1973, pp. 248-250]. 


The problem with the mean is that it is particularly 


ieee: 





susceptible to extreme observations or outliers. For 


example suppose e€ is drawn from a distribution of the form 


f(x) = p 





yen on0° 


where p+q = 1. If one compares the variance of the mean, y, 
wenan estimator for 8, to the variance of y, the median, 


2 


also an estimator of 9, he finds that as o° gets large, 


greater than 9 in the case of p = .9 and q = .1, then the 
variance of < becomes smaller than the variance of y.* Thus 
for this model of random error the median is more efficient 
than the mean. 

Regression problems, e.g. analysis of data in terms of 
relations like (2.1), (2.4) and (2.5), also involve outlier 
adifficulties. However the analogue to the median in regression 
problems is not as easy to calculate.** Also, even in the 
location parameter case, although the median is not as 
Susceptible to "wild shot" or extreme observations as the 
mean, it is only 64 percent efficient if the distribution of 
the error component is normal. What is desired for each of 
fae equations (2.1) - (2.5) is an estimator with robustness 
properties similar to the median but one which is relatively 
efficient under normality assumptions. 

Me errs is not a contradiction of the Gauss-Markov theorem 
Bnee the median is not linear in the observations. Fora 


more detailed discussion of this so-called "wild shot" 
problem, see Appendix I. 


*The estimator is that which results from minimizing 
Ely-y|. This minimization involves solving a linear program- 
ming problem. 
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III.2. The Technique and Rationale of Huber "Mw" Estimates 


Huber [1964] proposed a robust alternative to the mean 
and the median in the location parameter problem. The 
estimate is obtained by solving the equation 

a A 


C31) z oty, - 8) 
C= 


for 86 


J 
© 


a 


Mose) where o(z) Pon ™izle < we 


I 
N 


= ec for 2 


Iv 
Q 


=-¢ Or eZee 


ema Cc 1S a constant. 


This estimate is not linear in the observations unless ec is 
very large, in which case it reduces to the least squares 
estimate. Huber [1964] derived the asymptotic properties 

of the estimator as T+, and in tests run by Andrews et al. 
[1972] it performed well in terms of robustness compared to 
the sample mean. It was more efficient than the sample mean 
in the case of long-tailed error distributions and almost as 
efficient when the error distribution was normal. The reason 
for the term "M" estimate is that, in general, maximum 
likelihood estimates are the result of solving an equation 
like (3.1), where ¢(z) = Tet and f(z) is a density function. 
For example, for $(z) = z (ec =”) the solution to (3.1) will 
yield a maximum likelihood estimate where ¢€ is N(0,1). The 
specification of ¢(z) in (3.2) corresponds to a distribution 


with fatter tails than the normal distribution. In other 
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words, between -c and tc the assumed distribution is normal 
except for multiplication by a constant; however, beyond tc 
the assumed distribution resembles an exponential, with tails 
fatter than the normal.* An error model of this type means 
that a "wild shot" observation is more likely than in the 
strict normal distribution case; consequently it is not 
given as much weight in the calculation of the estimated 
location parameter as in least squares, which is maximum 
mecelinood if the distribution is normal. 

Other functional forms for $ have been proposed by 
Andrews, Hampel and Tukey [Andrews, et al.,1972] which allow 
mae funetion ¢ to go to zero as 2% becomes large. For 


example the ¢ suggested by Hampel is 


sin¢z7ce) for Z|) we 


o(z) 


0 for |zf{ > te 


while the ¢ suggested by Tukey is 


nh Care lz| < c 


0 zee 


o(z) 


Note, that when a normal distribution is assumed and the 
corresponding ¢ used, each observation has an effect directly 


proportional to a= 2) on the maximum likelihood estimator. 





*See Appendix B for a derivation of the distribution 
corresponding to the $ proposed by Huber. 
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For Huber's estimator each observation has an effect 
proportional to (y, -®) for values of pa S| <c¢ amelan 
effect proportional to c for values of |y,-®| >c¢ . For 
Andrews, Hampel and Tukey estimators, values Wa aad) which 
are greater than c have no effect on the estimate. The choice 
of c for these estimators involves a trade off between 
Gaussian efficiency (if f is really normal) and robustness 
(if the true distribution is really a longer tailed distri- 
bution than the normal).* The Andrews, Hampel and Tukey 
estimators will not be discussed further in this chapter. 
Rather, attention will now focus on the application of the 
Huber "M" estimate to the regression problem. In particular, 
the application will be to the DOD budget data previously 
mscussed. 

Huber [1973] proposed an analogue to the M estimator for 
a location parameter as a robust estimator of regression 


coefficients: for a regression problem of the form 


Ene proposal was to find 86 = (Bys+++2Byo+++2By) that is the 


solution to the equations 


*See Andrews [1973] for a short discussion. 
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T 
i. 3) : 


x) = eo 
t=1 i=) + it 
dt Caen 
; : 2 = 
where $(z) =z for |z]<e 
(34 ) =—“@ueiOr gee > c 
=~c for fg Xe 


The estimates will not in general be linear in the observa- 
tions. Note that for very large ¢c, the estimator becomes, 

in effect, the least squares (or maximum likelihood if normal 
residuals are assumed) estimator. However for smaller c, the 
estimator reduces the impact of extreme observations. 

Huber regression estimates can also be viewed as the result 


meesolving the following minimization problem: 


Al; pos 
(3.5) moe > ply. - %2 8.x.) 
t-1 © 4=7 77% 
re 
where (2) = kz for |z] < ¢ 
=ez-ke for z ane 


2 
=-cz-‘e forza< -c 


Note as ec gets large the minimization in (3.5) becomes 


simply a least squares problem. 
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Huber [1973] derives the asymptotic properties for 
estimators yielded by solving the system (3.3) under quite 
general conditions for defining ¢. In other words, his 
asymptotic results are not confined to a @ of the form of 
tase Hus 

There are two computational difficulties with Huber 
estimates. First, they are not one step estimates. In order 
to define the function $, some value for 8 is needed. Once 
the function is defined a new value for 8 can be found, and 
so forth. Huber [1973] has suggested an algorithm for solving 
the equations.* It is not immediately clear that when the 
algorithm stops a global minimum to (3.5) or a global maximum 
to the likelihood function has been achieved. In fact the 
optimization properties are not discussed by Huber. 
Fortunately, the algorithm is relatively easy to implement 
and given some rather weak conditions on ¢$ (and consequently 
em 6) the stopping conditions specified by Huber are necessary 
Pad sufficient for a global minimum to (3.5). This is discussed in 
Appendix C to this chapter. The second problem is that 
usually in a regression problem, it is not desirable to 
assume that ce is distributed like a N(0,1) random variable 


ie 
between tc and -c. Rather, the standard assumption is that 


*This problem and the algorithm are discussed in 
Appendix C. 
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Ey memorouributed N(0,0°) between +tco and -co. This 
complicates the problem since 9, a scale parameter must be 


estimated. Thus one additional equation which involves 


Aw 


B must be solved and the system (3.3) is modified by 
replacing 

p “A p A A 
yy. i” ae Bs X54) with Bohs =< ae Bs 4%44)/0] .* 


IfI.3. Comparison of Huber M Estimates and the 
Estimates of Chapter II 

Huber "M" estimates were calculated for the procurement 
and RDTE data for equations (2.1) through (2.5). Estimates 
were calculated for c values of 2.0, 1.5, and1.0. Only 
the results for c values of 2.0 and 1.0 are shown. The 
values for c = 1.5 lie between these. Tables III.1 and 
ITI.2 present the results and the corresponding results 


from Chapter II. 


"oeciiuber, 1972, Andrews et al., 19/2 ,; Huber, 1973 
and Huber, 1964] for discussions of the estimation of this 
scale factor. 
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c=1.0 GoeeuU Least squares 


B a B 0 8 0 
fe .1) ye Bee + cc, -961 NA .962 NA .969 NA 
oye, = bx, + e, x, -969 NA 974 NA — NA 

(median) 
sa 
(2.3) y, = Bx,e .963 NA -971 NA 977 NA 
973 
(median) 
at 
(2 .4) ye = Bx. e€ 1.1120 .982 1.181 1975 12473 sour 
ay u, ln Xp 
(2.5) y, = Bx, e * 997 .996 1.041 .991 1.242 .969 
PABLEN Lads 1 (NA = not applicable) 


Procurement Estimates 


Estimates for 68 in (2.3) through (2.5) are derived by taking 
exp(ln 8). For (2.3) through (2.5) actually 1n 8 was estimated. 
The estimates for 1n 8 are in Table ITII.1.1. 


c=1.0 c=2.0 Least squares 
(2.3) = 0ee = 026 = 0 
(<J027) 
(median ) 
me 4) Bo Lous. .169 . 384 
(2 .5) = Ones 038 , 215 


© 


JS ee el 
Estimated ln 8 for Procurement Data 
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i .1) 


(2.2) 


2.3) 


fe. 4) 


(2.5) 


Again, for 


estimated. 


C=))230 e=2.0 
B a g ‘ 
Bx, + Ey 977 NA .978 NA 
Bx, + EX, .989 NA .990 NA 
Ve 
Bx, .989 NA ~989 NA 
q Uy 
Bx, e 1.062 .990 1.076 .988 
Wane 
Bx, “e tt 1.029 1994 1.005 .998 
iets rT age (NA = 


RDTE 


Least squares 


“A aA 


B OL 

.982 NA 
1.093 NA 

.989 
(median) 
e025 NA 

.989 
(median) 
2.028 .903 
OOP moore 


not applicable) 


(273) throven (2.5) it was In"—y not Benatewes 


The estimates for ln $ are in Table JII.2.1. 


c=1.0 c=2.0 
-.012 -.011 
~059 EO 2 
0) 2x6: O05 


JOY) Eee Ls iS ee 
Estimated ln 8 for RDTE Data 


Tij=-le 


Least squares 


.024 
-.O11l 
(median) 


1 Oz 


Asie 





Some of the most interesting results are those for 
equations (2.4) and (2.5) both for procurement and RDTE data. 
Before addressing those results, salient features of the 
results for (2.1) through (2.3) will be pointed out. First, 
for the procurement data all of the techniques estimated 
coefficients — estimated percentages of the budget request 
that are granted — which were equivalent. The only 
exception to this statement is for equation (2.2) for which 
the least squares estimate was .993 while the median Y 7X, 
estimate and the Huber (c=2.0) estimates were .974, and the 
Huber (c=1.0) estimate was .969. In other words, the least 
Squares estimate implies that over 99% of the budget request 
is appropriated, while the others say about 97% of the request 
Mmeeappropriated. atc 15 Orta cul TO aSSess tenese 
differences without some measure of stability or variation 
meechne estimates. This topic will be taken up in Chapter IV. 
[sO Of Interest is the fact that for the procurement data 
the results were similar when c=1.0 and c=2.0. 

The RDTE results for (2.1) through (2.3) ere of even more 
interest. Recall that in the previous chapter it was observed 
that certain data points appeared to be affecting the least 
Poaleres estimates for (2.2) and (2.3). For example, the least 
squares estimator for (2.2) implied that 109% of the budget 
request is appropriated. Using the median V4 value as an 
estimator in model (2.2), and the median ln (y,/%,) value as 
an estimator in (2.3) produced results which appeared more 


reasonable than the least squares results. The application 


i ei 





of the Huber "M" estimator to (2.2) and (2.3)yielded results 
that were almost identical to those achieved by the use of 
the median. The result was not particularly surprising 
pence, as mentioned in section III.1 of this chapter, estima= 
tion for (2.2) and (2.3) was really a location parameter 
problem. Andrews, Tukey and others have already noted that 
both the median and the Huber "M" are useful estimators in 
location parameter problems when the error distribution is 
non-normal. Huber "M" is more efficient when near-normality 
of errors holds. It is interesting to note that the extra 
computational effort involved in computing the Huber "M" 
in (2.2) and (2.3) did not yield results different from 
the “easy to compute" median. As with the procurement data 
mie importance of small differences such as that between the 
least squares estimate and the Huber "M" estimates for (2.1) 
using RDTE data must await the analysis of the stability of 
the estimates in the following chapter and analysis of 
residuals in the next section. 

In the case of both the Procurement and RDIE data — 
but especially for the RDTE data — the results of applying 
least squares and the Huber technique to (2.4) and (2.5) 
appear to be different. For example, for the RDTE data, aly 
fe.5) g is almost three times as large when least squares is 
used as when Huber "M" estimates are computed. Again the full 
impact of the differences must await an analysis of (i) the 
stability of the coefficients in the next chapter, (ii) examina- 


tion of the residuals. However, it is possible to sharpen 
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our intuition about the differences between the results for 
(2.4) and (2.5) by viewing them in another way. 

Since the percentage of a request granted for a request of 
size Xy is modelled in (2.4) and (2.5) as being equal to Bx, }), 
the actual requests for the procurement and RDTE data were 


taken and Bor 


was computed (i) when ln B and a were estimated 
via least squares and (ii) when they were estimated using Huber's 
method. Thus, for the same Xe values it was possible to compare 
the percentages estimated to have been approved when least 
squares was used and the percentage estimated to have been 
approved when the Huber "M" estimate was computed. Tabees" Liles 
and III.4 contain stem and leaf plots of the percentages. 

Before discussing the tables it should be noted that since 
a < 1.0 under all estimation techniques each of the techniques 
estimates tnat as X4 > the request, increases the percentage yt 
the request granted will decrease. 

Examination of Table III.3 highlights the fact that the 
Huber "M" estimates the percentage of the request granted to be 
smaller than does least squares. For (2.4), the estimated B 
and a under least squares imply that for 17 of 80 valuecmuried 
it is estimated that Congress would be appropriating greater 
than 100% of the request. On the other hand, when Huber '"M" 
estimates are used (c=2.0) it is estimated that for only one 


of the 80 x. values tried would Congress be appropriating 


G 
greater than 100% of the request. (99% for c=1.0). For 80 


x, values tried, the estimated percentage of the request 


iG 
granted would be for the most part between .96 and .99. For 
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(2.5) similar differences are noticeable. When the Huber 
"M" approach is used, these percentages are mostly between 
-96 and .98 (c=2.0) or .96 and .97 (c=1.0). The mean 
estimated percentages of the requests that would be granted 


for the eighty X 4 values tried are given in Table III.3.1. 


c=1.0 c=2.0 Least squares 
| oa Mt 
(2.4) y, = 8x, e .968 972 981 
bea lel“ 
(2.5) y, = B8x,%e © © — ,969 971 975 


TABLE a Eis. 


The Pile rerences between the estimated percentages of the 
request that will be granted when least squares as opposed 
to Huber "M" estimates for 8 and a are used are even more 
noticeable in Table III.4 for the RDTE data. 

Girmesresmits in Table IIi.4 probably reflect better than 
any other information presented so far the implications of 
the differences between the Huber "M" estimates for B and a 
for (2.4) and (2.5) and the least squares estimates. For 
equation (2.4), the least squares estimates for 8 and a imply 
that for a low enough request the estimated percentage of the 
request appropriated is greater than 1.10. The Huber estimates 
for B and a in (2.4) imply a maximum percentage of 1.025 for the 


Xy values tried. The differences are more marked for (2.5). 
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The least squares estimates for 8 and a imply that, for three 
of the Xy Values, as much as 150% of the request was granted. 
The largest value, for the xX, values tried, when the Huber "M" 
approach was used was 100.5%. Table III.4.1 contains the 
mean estimated percentage of the request granted, for the 


63 RDTE x, values tried. 


c=1.0 c=2.0 Least Squares 
1 Ut 
(2.4) y, = 8x, e oS 991 | 1.029 
Uae 
(2.5) y, = Bx,%e *  * 989 998 Toga 


Th Bibel ea 


When trying to assess the reasons for the differences 
between the least squares and Huber "M" results which are 
reported in Tables III.3 and III.4, it is useful to recall 
mMicdm@escatterplots of in Y, versus in X in Chapter II reveal 
that the least squares results were dominated (distorted) by 


a few extreme or outiier observations. 


Paes Residuals 


One of the interesting features of the Huber "M" approach 
is that it provides an easy way to examine residuals. Recall 


that the approach solves the following: 
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oe 


Min pty, mR 2 6.x) 
nn 1 es ee ro 
where 
o(z) = Loe for |z| < co 
= Cczo - (ceo) for Leo 
-CZO - (ea)? for Z < -co 


and o is a scale parameter. 


p 
We can label the residuals for which hig 2 B.X4,1 < co 
i=1 
* * p 
as Group 1 residuals, residuals for which Me oy B aXe = ce 
oT aa a 
* » BP 
as Group 2 residuals and those for which yy -~ 2 BiXs4 < -co 
i=l Zz 


as Group 3 residuals. 


mete Group 2 and Group 3 Residuals 

The items or appropriations accounts which were in the 
Group 2 and Group 3 residuals were approximately the same for 
each of the different models (2.1) through (2.5). However, 
the rank order of these residuals varied between approaches. 
For example, 1955 Navy RDTE, where the request was $61 million 
and appropriations were $419 million, produced the largest 
Beste vesor Group 2 residual for (2.2) through (2.5), and only 
the third largest positive residual for (2.1). However, the 
mane order of the Group 2 and Group 3 residuals is not nearly 


as important as the order of the residuals when ordinary 
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least squares is used. When least squares is used, each 


A 


p 

observation receives a weight proportional to ly, ~- = &8,x,,| 

. as | 
in the calculation of 8, while when the Huber "M" approach 
is used, Group 2 and Group 3 residuals receive a weight 
Mmaopomel.Onal to co. 

iteeroliowing table gives;for c=1.0, themivercmeeruc senate 

to Group 2 and Group 3 residuals common to all the models, (2.1) 
through (2.5), and the budget request and final appropriations 


for each: The tables reflect no particular rank order since 


this varied from model to model. 


iieemeend Year REQUEST APPROPRIATION 
1954 PEMA LOO aa, 2226.6 
1963 PANN 3065.90 3834.7 
1957 AF Aircraft 3859.9 Sol 
1962 PEMA 1803.0 2532.6 
1960 AF Missiles ese. 1 2540.5 
1962 PAMN 2000.0 2680.9 
1959 PEMA 970.0 1669.3 
i960 AP Aircraft 2934.1 3497.2 
1963 AF Aircraft oils) oe 3562.4 
1962 AF Aircraft Be Gee SDS) =e 
TABLE i 


Procurement Group 2 Residual Items 


bite 





There were minor differences between the Group 3 residuals 


for (2.1) and for the other equations. 


These involved two 


items, not in the table, which are different in the set or 


Group 3 residuals for (2.1). 


Item 
1954 
1954 
1954 
1953 
OD 
1970 
1969 
OL 
1970 
1969 
1969 


and Year 

AF Aircraft 
AF Missile 
PAMN 

PEMA 

AF Missile 
PAMN 

PAMN 

Meni rcral tv - 
PEMA 

PEMA 


AF Aircraft 





4Not 


*¥Because of the magnitude of th 
and (2.5) results imply that the percent 
granted decreases as Yr 
appear as Group 3 resi 


pei@eoup 3 for (2.1). 


TABLE III.6 


Procurement Group 3 Residuals 


REQUEST 
4283. 
32002 . 
1924, 
2544, 
ieee 
3235. 
2202", 
3259 
5069. 
5626. 
4612. 


Lites 


0 


Oo WM FF £- 


rN 


2453. 
2574. 
Le eas 
1889. 
1394. 
2620. 
2574. 
Bee 
W254, 
oi 8) Sle 
3860. 


= F&F WW WwW 


APPRORR ATION 


x 


** 


e request and because the (2.4) 


equest increases, 
duals for (2.4) and (225). 


ape of the request 
these items do not 


Note that the Group 2 residuals contain items from the early 
1960's and late 1950's except for 1954 PEMA. Air Force items 
are particularly noticeable in the Group 2 residuals. On the 
other hand, the Group 3 residuals are dominated by items from 
the early 1950's and post-1969 items. This is consistent with 
the discussion in the last chapter of a possible change in 


Congressional decision rules starting in 1969. 


Item Request Appropriations 
1955 Navy oy 6) 419.9 
1960 AF 13020 HEY sl. 
1962 AF 1637.0 2403.2 
1959 Navy G2) lee Oeuleeee 
1957 AF 610.0 falOr 


Group 2 Residuals RDTE 


The Group 2 residuals for (Pail) also contained more Air 


Force items for 1961, 1963, and 1969. 


<— "3iiee @ oe are 6 oo efirrwar—we set” 4 a> & © > 


ee 


Item Request Appropriations 
1954 Army 475.0 345.0 

1973 Army 222% 7 1829.0 

mo7O Army 7849.5 1596.8 

1970 AF Sol 2 3060.6 

1973 Navy 2813.8 2545.3 
1954 Navy gels 58.6 ae 
1954 AF SSiifale NhO.0 


Group 3 Residuals 
#Not part of Group 3 for (2.4) and (2.5). 
BenOu part of Group for (2.1). 
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The Group 3 residuals for (2.4) and (2.5) also contained 
the item "1953 Navy", where the request was 75.7 and the 
appropriations 70.0. Despite the fact that this was only a 
5 million dollar cut, (2.4) and (2.5) imply that a small 
request should generally be cut very little if at all. Thus 
1953 Navy appears as a Group 2 residual for (2.4) and (2.5) 
indicating the appropriation was much less than expected. 

The Group 2 and Group 3 residuals for RDTE demonstrate 
patterns similar to those noted for procurement. That is, the 
late 1950's and early 1960's items are noticeable in the Group 
2 residuals, while the early 1950's and post-1969 items 
dominate the Group 3 residuals. 

In summary , the items producing the largest residuals when the Huber 
tM" techniaue was used resemble those producing the largest 
least squares residuals. However, the magnitude and distribu- 
tion of the least squares residuals differ. For the RDTE 
and procurement data analyzed, use of the Huber method allowed 
the residuals to stand out more dramatically. This is because, 
heuristically speaking, the Huber method does not try as hard 


as least squares to avoid having a few very large residuals. 


freer. Distribution of Residuals 


Both the least squares and the Huber "mM" techniques assume 
that the distribution of the stochastic disturbance is 
symmetric about zero. One way of examining the appropriateness 


of an estimator is to examine whether or not the residuals 


Tries 





which result when that estimator meen are symmetric or 
nearly symmetric about zero. There are certainly many other 
ways to probe whether or not the assumptions underlying the 
Secrimacion technique are satisfied; Suchas, for example, constructing 
histograms, stem and leaf plots and making probability plots 
(Wilk and Gnanadesikon,1968]. However the simple technique of 
counting positive and negative residuals will quickly reveal 
some of the basic information being sought. The following 
tables III.6 and III.7, show the number of negative residuals 
as opposed to the total number of items for (2.1) through (2.5) 
when the Huber "M" approach, with c=1.0, and when least squares 


were used. 


Huber M, c=1.0 Least squares 
fe .1) ee Xpet uke 33/80 33/80 
fe .2) Ta = Besugaanaress 35/80 42/80 
Uy. 
f.3) Ye = Bx,e 36/80 42/80 
u 
(2.4) y, = Bx,%e © 37/80 46/80 
bl. day oe 
2.5) y, = 8x, € ns 36/80 54/80 
TABLE III.5 


Procurement — Number of negative residuals 
divided by total number of residuals 


Tepe 


Huber M, c=1.0 Least squares 


(2.1) ve. = Bx, ce 2 3/63 28/63 
(2.2) vy, = Bx, + &, x, 32/63 51/63 
Mt 
(2.3) ¥, = Bx,e 30/63 48/63 
eae 
(2.4) y, = Bx,%e 30/63 35/63 
teen xX 
(2.5) y, = Bx,"e © * 31/63 28/63 
TABLE III.6 


RDTE — Number of negative residuals 
divided by total number of residuals 


The results in Tables III.5 and III.6 reveal a few marked 
differences between Overall residual patterns resulting from 
the application of least squares as opposed to the Huber "mM" 
estimator. For the procurement data, least squares applied 
to (2.5) yields a large percentage of negative residuals — 

14 more than one-half — while the Huber "M" yields a percentage 
mitohely smaller than one-half. A similar but not quite so 
noticeable difference results for (2.4). On the other hand, 
mormceel)—-(2.5) using the Huber "M" the fraction of negative 
residuals is always less than one-half. For the RDTE data the 
major differences between the two approaches, in terms of 
positive-negative symmetry of the residuals, is for (2.2) 

and (2.3), the location parameter problems. Least squares 
aemeceto (2.2) yields an estimate for 8 which results in 

57 of a possible 63 negative residuals as opposed to the 
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Huber "M" which yields 32 of a possible 63. For (2.2), least 
squares results in 48 negative residuals while the Huber "M" 
results in 30 negative residuals. The similarity between the 
numbers of positive and negative residuals when least squares 
and the Huber "M" are applied to (2.4) and (2.5) is somewhat 
deceiving. An inspection of the individual items which 
correspond to the positive and negative residuals reveal 

marked differences between the two approaches. "For examphe, 

for (2.5) the residuals of a total of twenty-eight items had 
signs which were different when the Huber "M" approach was 
applied than the signs resulting when least Squares was applied. 
iemeorder to look into this matter a little more closely, recall 
first that for (2.4) and (2.5) the least squares residuals were 
mostiy negative (except for a few large positive ones) for small 
values of log Xy and mostly positive for large vaiues of Log x, 


sees ragure Jl1l.2 for illustration. 


Residuals 





Fe Guin etal 
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Thus despite the fact that the numbers of positive and 
negative residuals were nearly equal, they bore an undesirable 
relationship to X,- As reported in Table III.6, the residuals 
resulting from the application of the Huber "M" approach to 
(2.4) and (2.5) are also symmetric around zero but do not show 


a relationship to Xy like that in Figure III.2. 
ITI.5 Summary 


This chapter commenced with a discussion of the problem 
of robust estimation when a linear regression can not be 
reduced to a location parameter problem and when one is not 
prepared to pay for the lack of efficiency of the sample 
feomen th the error distribution is normal in the location 
parameter ovroblem. The need for a robust estimation 
technique was especially evident in Chapter II upon 
examination of scatterplots, least squares coefficient 
estimates, residual patterns for the models (2.4) 
and (2.5), and, to a lesser extent for (2.1). The Huber "M" 
estimator has been proposed and utilized as a robust alternative 
to least squares. The logical basis for the estimator, and 
its relationship to maximum likelihood estimation, were 
discussed. Section III.3 contains a comparison of Huber "M" 
and least squares estimates for the coefficients in (2.1) 


through (2.5) for both procurement and RDTE data. The results 


ie) 





were noticeably different for (2.4) and (2.5) both for 
procurement and RDTE and for (2.2) and (2.3) for RDTE data. 

The final section, prior to this summary, contains a discussion 
of the Group 2 and Group 3 residuals produced by the Huber "M" 
approach and a comparison of the distribution of the residuals 
under least squares and Huber "M" estimation. 

At this point a short assessment of what has been 
accomplished is in order. The'’Huber "M" approach yielded 
results, for (2.4) and (2.5) for procurement and RDTE and for 
(2.2) and (2.3) for RDTE, which appear to be more reasonable, 
substantively, and more consistent with the assumptions 
underlying the estimation technique. than those produced by 
least squares. However, the implication for the defense 
budget question is possibly even broader. No claim has been 
made in this analysis that Congressional action on defense 
budgets is completely deterministic and predictable. Rather, 
the analysis has been based, more or less implicitly up to 
this point, on the view that embedded in the process of 
Congressional consideration are certain regular elements which 
resemble the outcomes which occur when simple rules of thumb 
are applied and certain irregular, and unusual elements which 
moemneyverappear to be the product of the application of fixed 
decision rules or simple rules of thumb (or may be the result 
of the application of different decision rules). The robust 
regression approach, through the specification of Group e and 


Group 3 residuals, has allowed us to distinguish or separate 
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out of these two sets of elements. One might argue that 
analysis of least squares residuals would do the same thing. 
However, the robust regression approach has in some sense 
diminished the contamination of the estimation of what is 
going on in the fixed or regular area by the data generated 
by unusual or irregular occurrences. Such a contamination 
can be considerable when least squares is used. 

Several questions still remain unanswered. For example, 
how stable are the coefficient estimates — both least squares 
and Huber "M" estimates? That is, can reasonable confidence 
limits be placed on estimated coefficients? How do (2.1) 
through (2.5) compare among themselves in terms of stability 
of estimates and prediction intervals? lit may not be possible 
to answer these questions by only re-analyzing the data at 
hand, since the theory underlying interval estimation is not 
well-developed for the Huber "M". Thus, data might have to 
be constructed which provide a standard against which to 
compare results for the data at hand. One reasonable procedure 
is to synthesize such data, i.e. by sampling from a suitable 
Gisturbance distribution. 

Several questions have been addressed and partially 
-answered, but need further analysis. What can be said about 
what appear to be the “irregular" elements of the process. 

Are they completely inexplicable or are there as many different 
explanations as there are different budget items and years. 
Certain observations have already been made, such as the 


post-1969 "cutting" mood of the Congress and the generosity 
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of the Congress to the Air Force between 1957 and 1969. 
Possibly some models are more appropriate for parts of the 
data than others. These and other questions will be addressed 


in the next two chapters. 


III-32 


APPENDIX A 


The Mean and Median as Location Parameter Estimators 


meee The Wild Shot Problem 


The classic location parameter problem is one with the 


following form: 
(1) Wp ee (a0 


where 6 is a location parameter and the e, are independent 


t 
and identically distributed, E(e, )=0 and var(e, )=0°. The 
Gauss-Markov theorem [Scheffé,19591 says that the least-squares 
estimator, in this case the sample mean, is the best (minimum 
variance) unbiased estimator which is also linear in the 
observations. If the Ee, are N(O0,0°), then the least Squares 
estimator is a best (minimum variance) estimator. 
Unfortunately the sample mean is susceptible 
to extreme observations, since the value of a least Squares 
estimator, 6 is proportional to the absolute difference 
ly,-6|. The following example demonstrates this point. 


If the ce, are N(0,1) then for a sample of 10 observations 


ie 
the variance of y is 0.1. However, suppose 


Phat the errors, Ee, » are of the following form: 


pels 





E WAN CO <i) with probability p 


n, © N(0,0°) with probability q = 1-p 


} 


The density for Ey then becomes 


2 2 2 
(3) ft, Gx) = BOLE, ae 
€ 
t q OT O10 
In this case, E(e,) = 0, and 
(4) Var(e,) = E(e,°) es ooo qo* 


Suppose p = 0.9, and of S10) eiia\e o@ = 10. Then, on the average, 


one observation out of 10 comes from a distribution with 


large variance. However, using (4), var(e,) is equal to 


E(e,°) =so(0) (oy oe (Tol 5 TP Jake 
bls 


and the var(y) equals 0.19. Note what has happened: when 
Ememoneeer ten observations comes, On the average, lroned 


contaminating distribution, the variance of the estimator, y, 


nearly doubles. 
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A.2 The Median as a Location per eme tee Estimate: 
The Normal Case 
The median is an alternative estimator to the mean in 
location parameter problems when disturbance distributions 
are symmetric. It can be shown that as the sample size, n, 
gets large, then for a distribution that is symmetric around 


zero, the following statement can be made about y, the 


sample median [David,1970]: 


oy) = 0 
al 


vary) a) 
in(f(o)) 


where f is the density from which a sample is being taken. 
ims if ¢, is N(0,1), then 
ey 

n 


var(y) = a (as n gets large) 


n 


while var(y) ‘= = 


Thus the median has an efficiency, with respect to the 
mean, of roughly 7 = 0.64. In other words, if the Ee, are 
N(O,1), then it will take 1.57 as many observations when 
using the sample median as opposed to the sample mean in 


order to achieve the same variance for the eat imator. 
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A.3. The Mean Versus the Median: The Wild Shot Case 


The relationship between the variance of the sample median 
and the variance of the sample mean reverses when the Ee, are | 
not normally distributed but are distributed according to (2). 

The variance of the sample median, when the Ee, are 


distributed according to (2) is approximately given by 


a OT 
(6) var(y) = —; 
in(p + 4)° 


If p = 0.9, q = 0.1 and o* = 10 then 


Li68 


n 





var(y) =x 


Thus for n = 100, var(y) = .0168 and var(y) = .019. 
This "robust" behavior of the median as compared to the 
mean is noticeable in the case of many long-tailed error 


distributions [Andrews, et al., 1972].. 
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APPENDIX B 


The Huber "M" Estimator as a 
Maximum Likelihood Estimator 


B.1. Basic Concepts and the Normal Case as An Example 


HOrestne discussion that follows, assume that what 
is being sought is an estimate for 8 in the model 


- 


Cl) Y, = 8x, + €, 


where Ey is a random variable and the E.'S are independent and 
identically distributed. The following discussion can be 
generalized to the case in which 6 and X, are vectors. Let 
the density for « be p(y-8x). If one observes Yy,¥o.+++5¥m 


and XqoXqo+ 00 9Xm then the log likelihood of the sample is given 


by 


tl 
Wo mire 


oy log p(y, - BX) 


tal 


wn 


A maximum likelihood estimate for 8, 8, is an estimate 
which maximizes the likelihood, which is equivalent to 
maximizing L. The necessary condition for a maximum yields 


the equation 
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(4) oL _ 7 P'(y, - Bx,) _ , 
3B t=1 t ply, - Bx,) 


ENG 
Ji 
x x. oCY, - BX, ) = 0 , where 
t=1 
Se ae 
: p 


The maximum likelihood estimate, 8, satisfies equation 
(4). We recognize the possibility of multiple roots, but 
Gisregard the complication for the present. 


If the e€, are’ assumed to be N(0,1) then $(z) = z. 


tC 
This is easily seen since 


p(z) = = eo e(z) 
fen 
2 
al 
p'(z) = =F oe e 2% 
oT 


Bay a 


and 
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ian 





ll 
peeex fy, —-Rx,) = 0 
eee tt f 
or 
A 2 Xi V4 
= —> » Which is also the least squares 
ex 
\e 


estimator for 8. 


B.2. <A Distribution That is Implied by the Huber "M" Estimator 


The Huber "M" approach, replaces $¢(z) = 2z, by 


(-c for z<e 


(5) o(z) =4 z for |z| < e¢ 
[i iOn ei Zen> TC 
tues pOSSible to find a distribution for which the 


likelihood is maximized when the following equation (6) is 


solved for 8 and when certain restrictions, such as continuity, 


are placed on the distribution. 


T 
(6) Meee oy, -Bx,) = 0 
me tt t 


where ¢ is as specified in (5). 


The density function (and distribution) is defined over 
three disjoint regions of interest: (-~,c], Geen tc) a bee 


Call these Regions 1, 2, and 3 respectively. For 
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For these regions, the specification ¢ in (5) implies the 
following differential equations, where Da refers to a density 
Gefined over region i. 

p, '(z) 
ay hae 


pb, ' (2) 7 


P2' (2) 
p3(zy = 
The solutions to these equations are p, (2) = ale) tae 
| a 
Pp, (z) = He ** and P2(z) aE © If one desires the distribu- 
tion which is being derived to be symmetric around zero, then 


1 2° Call this constant K. Also, to simplify matters, let 


K=K*e, so that p,(z) = K*ce°* and p2(z) = K¥ce °", The constants 


peam@aek* must satisfy certain conditions. First, 


ihe Cc te ne ay = 
(7) f K¥®cee°* az + f He @* az +  K*¥ce °° dz = ae 


Next, it is attractive (but not essential) that the 
density p, which is being derived be continuous. This will 


be the case if 


(8) p,(-c) = pole) 


(9) p,(c) P3(c) 


Equations (8) and (9) are not independent. However, (7) and 


(8) are. Consequently they can be used to derive K* and H. 
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First, using equation (8) it follows that 
2 2 
K*¥ce ° = He %° 


and 
(10) K* = 


Substituting in equation (7) yields the following 


2 tec 2 Shae 


= ee +H Se7% az+ He eC ] 
ae 
2 te, 2 
(11) oe cre a2 ae = 
c 


The second term in equation (10) can not be put into 
closed form so that a closed form expression for H can be 
stated. However, it is easy toobtain an approximate value 

le mc 
for f e ® az by using the standard normal tables to 
-c 
get the area under the standard normal density between -c and 
+c and then multiply this value by 4/27 
Let us call the area $(c) -6(-c). Using this, H is found 


to be 


C 


12) H 5 
emec) 4 aAlon, e( 6(c) - o(-c) ) 
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Returning to the original problem, the Huber "M" approach 


yields a maximum likelihood estimator for B when the density 


of E, is of the form 
K et ce Z<-c 
Re Oy 
(13) Page 4 He 7 lz] < ec 
eres ue AS ie 


where K = K%*ec and K* and H are found using (11) and (12). 

This solution assumes that any scale parameter is already 

fp ilitcit in c. If a scale parameter also needs to be estimated, 
the formulas change slightly, however the basic results remain 
the same: p(z) is like a normal density between (-c,tc) but 
has "fatter," exnonential tails than a normal density beyond 

=-c and tc. 

Of course, the distribution defined in (13) is not the 
only distribution implied by the Huber "M" estimator. For 
example, if one wanted p(z) to behave exactly like a normal 
density between -c and tc, H could be made equal to = 5 2M 
which case p(z) would not be continuous unless c SO 


mitch am@lowed (12) to be satisfied. 
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me ENwiey, C 


Algorithm for Deriving Huber "M" Estimates 
and a Proof of Optimality 


C.1. Iterative Algorithm for Deriving Huber "M" Estimates 


In Appendix B it was shown that the Huber "M" estimater 
is a maximum likelihood estimator. It is also true that the 


Huber "M" estimator is the solution to the problem 


ah 
(1) min 2(8) = = ply, ~ Bx,) 
B Bas 
~cz-ke* Z < -c 
where p(z) = Lege Il, Ss © 
ez-ke* Z > ¢ 


Notice that (1) is a modified least squares function. 
However, intuitively, the influence or effect on the estimate, 
B, of observations for which ly, - Bx, | 7 1c SiS MOL 
proportional to \iiy Bess || as in the least squares case. 

The procedures for solving (1) which have been proposed 
are iterative procedures where an estimate of 8 at step i 
is used to obtain a new eStimate at step itl [Huber,1973, 
Andrews ,1973]. One of these algorithms [Huber,1973] will 


be discussed in detail in this appendix. 
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The algorithm requires the following steps. 


(a) Start with an initial value of 8B, for example the least 


th 


squares estimate; call this g (1) tes TN eset: estimate is 


pS) (4=2,2,...). 
(b) Based on the current estimate of 8B, partition the 


observations into three groups, p fi) rot? rsh) 


1»? 
Group pit) ic. vie gt) x < ~c} 
(2) Group 7§)) = {t: v,. ~ See ae <  ¢} 
Group 5 = {t: Ves = oe See: 


If the partition is identical to that achieved for p‘i7?) 


then stop. Use gt) as the estimate for 8. Otherwise go to 
step (c). 

(c) Minimize, if possible the function Z.*(B) with respect 
to B where Z,* is defined like Z except that the partition 
(7{4) (4) 964)) a5 nere fixed or in other words is not allowed 


2 (Cilae 
to change as ® changes. Call the solution g‘t*?) 


GO to 
step (b). 
C.2. Some Properties of the Huber Algorithm 

Very little is said about the algorithm in (Huber,1973]. 


The literature on robust regression apparently does not 
contain statements concerning the characteristics of the 
The only problem that could arise is if a minimum of 


al 
Z*(B) can not be determined, that is, if i ) is empty in 
which case the solution to the minimization of Z* is unbounded. 
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algorithm, except that it appears to work. Two questions 
which will be addressed here are: Is the stopping rule a 
true optimality condition? Does the algorithm achieve a 

global optimum or can it stop at a local optimum which may 


be far away from the global solution? 


Proposition: The function Z is differentiable with respect 


to 8, and 
; ( ; ( ) 
eee - 5 xX, Ply, ~Bx,) = 2 x, oly, — BK 
aay t t Regu w t 
where 
Ae date NA [Dae < -¢ 
en — BX) = (y, - Bx,) LP |e, = oe. || 9c 
+¢ ete Ye Xp oe 


Proof: This is an immediate consequence of the chain rule, 


since Z is a sum of composites of differentiable functions. 


The next result shows that satisfaction of the algorithm's 
stopping condition automatically satisfies a necessary 


condition for a minimum to Z(8) 
Emoposibton: When the stopping condition is invoked, then 
Z'*(B) = 0 


Proof: Let St) = on) aft? ta The stopping condition 


+ 
says _ (itd) = 7{h), Recall that g (itl) is thegbewhwven 


minimizes Z*(8). Thus 


ITi-C-3 


(i+1) 


(3) } + ae x, )+ 2 A - xc + £ r xe = 
tel, teT., tel. 

1a _ (itl) = ft) then 
(4.a) x, (y ~ git), + £ - x,c 

gan) ee Gai) 

teT, tel, 
+ XC = 0 
ter (itl) 
3 
and 
for tery tt) A (y,  BST*2) 5, ) < -¢e 

(4.b) cer (it) ; ly, ~ BD) Se 

il : y, ~ pt), Fo 


The left hand side of (4.a), together with the inequalities 
in (4.b) correspond to the definition of Z'(8) while 
equation (4.a) itself says Z'(B) = 0. 

Since Z'(8) = 0 when the stopping condition is invoked, 
the necessary condition for a minimum to Z(8) is satisfied. 


mae mstaction of the algorithm's stopping condition 
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satisfies not only necessary but also sufficient conditions 
for a minimum of Z(8), then it is not necessary to worry 

about potential local minima or stationary points when the 
stopping condition is satisfied. The approach to proving 
sufficiency is based on proof of an interesting characteristic 
of Z(8). This characteristic and the proof to it not only 

are useful for the present analysis, but also indicate a 

type of function p for which necessary and sufficient 
conditions for a minimization like (1) can be stated for an 


algorithm like Huber's. 

Proposition: 2Z(8) is convex in 8B. 

Proof: By the definition of p(z), p is convex in z. Thus 
o(atbz), where a and b’ are constants is also convex in 2. 
Theorem: When the stopping condition is satisfied necessary 


and sufficient conditions for a minimum are satisfied. 


Proof: The theorem is true since Z'(8) = 0 is a necessary 


and sufficient condition for the minimization of a convex 


famterenititable function. 
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CHAPTER IV 


In the previous two chapters, defense budget data were 
analyzed feeredvar Jone which were designed to describe or 
represent general tendencies and characteristics of Congres- 
sional appropriations action, as a function of requests. In 
addition, some attempt was made to identify situations in 
Mmorenh these tendencies or characteristics are not found or 
present. Different techniques were employed in estimating 
the coefficients in these equations, some robust and others 
not so robust. In Chapters II and III, coefficient estimators and 
residuals were compared both for analysis using least squares 
and analysis using Huber "M" estimates. In Chapter V, 
the implications of the patterns of the residuals will be 
discussed further. However, before that discussion begins 
fhe discussion of estimation begun in the previous chapter 
Will be completed. 

Up to this point our attention has focused on obtaining 
point estimates. The purpose of this chapter is to expand 
on the work of earlier chapters by deriving some measures of 
wariability or stability of the point estimates andby estimating 


confidence intervals for the coefficients in equations (2.1) 


through (2.5). The first section of this chapter poses the 
problem of determining the variance of an estimator. In 

Section two, the jackknife is proposed as a possible solution 
ements problem. Finally, results of jackknifing least 

Squares and Huber "M" estimates are summarized and discussed. 
The appendices contain discussions of the problem of biasedness 
of estimators for 8 in equations like (2.3), and of the 
rationale behind probability plotting, together with probability 


plots for jackknifed estimates. 
ieee The Problem 


IV.1.1. Background 


The basic questions that are addressed in this chapter 
Eyememe result of the fact that estimetors (least sdauares, 
Huber, etc.) are modelled as random variables. 

Since estimators are random variables, important questions 
arise concerning their properties. What is the expected 
value, what is the variance, what is the distribution of a 
particular estimator? If the expectation of the estimator 
equals the true value of the parameter being estimated, the 
estimator is said to be unbiased [Kendall and Stuart ,I, 

1963, p. 222]. If as the sample size gets large the estimator 
converges in probability to a generate random variable which 
Samealis With probability one the value of the parameter being 
estimated, then the estimator is said to be consistent 
Sergey and Stuart,l1,195l1,p. 3). Different estimators of 


the same parameter are often compared by examining their 
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variance. If the estimators are unbiased, this is done by 
computing their relative efficiency [Kendall and Stuart,II, 
1951,pp. 5-6], which is defined as the ratio of the variance 
of the estimators. Numerous other characteristics of 
estimators become important when the estimation problem is 
put into a decision theoretic context [Ferguson ,1967]. 
Unfortunately, it is not always possible to obtain an 
unbiased, consistent, efficient estimator, and sometimes even 
When one is found, these properties are sensitive to certain 
assumptions made about the problem (such as the normality 


of e The wild shot problem of Appendix A of Chapter III 


»)- 
gives an example of an estimator which suffers when certain 
assumptions are not met. 

If there are competing estimators — and competing problem 
formulations, such as (2.1) through (2.5) — it is useful to 
ask and attempt to answer certain questions. Is the variance 
of the estimator large? Consequently, the question of how 
this variance should be estimated arises. Another question 
is associated with the distribution of the estimator. Is 
it normal? What about large sample behavior? 

Not all of these questions will be answered fully in what 


follows. However, estimates will be derived for the standard 


error of the estimated coefficients, both when using the 
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Huber "M" approach and least squares. This will allow us to 
construct interval estimates for 8 (and a where applicable) 


maete.l) through (2.5). 


IV.1.2. The Problem in the Least Squares Case 


The problem that is of interest at this point is twofold. 
First there is the question of the bias of the estimator; 
secondly, there is the question of how to estimate its 
Stability, as measured by the variance. 

Least squares applied to (2.1) yields an unbiased 
estimator for ~. This is shown in most standard textbooks 
Wrehnston,1963,p. 15]. Also, least squares applied to (2.2), 
arter dividing through by Xp» yields an unbiased estimator 
for g. However, there are some problems when estimating 
comparable parameters in (2.3) through (2.5). In each case 
logarithms are taken and ln 8 is estimated. The least 
Squares method applied to the transformed equations yields 
unbiased and maximum likelihood estimators (assuming 
independent normal residuals)for ln & and a. However, as. 
shown in Appendix C, the estimator for 8 given by g = sek 
ms, MOt wunbaased. 

When the least squares method is applied to (2.1), the 


varvance of the estimator for 86, if it exists, given in 


most standard texts is: 


2 of 
Clip.) oc = 
B al 2 
( X, ) 


where of is the variance of Ey: 


Tice 





Ne 


One method of estimating O2 for (2.1) 'oume est imate o@ 


by 
Jt 
Dy ee 
ae = 
Cie? ) of = 


where a= y, ~ BX, seand cubs bance 5° for 6° nama Geely. 
Similar formulas can be derived for var(B) in (2.2) amd 
var(a) im) (2 ote ana@e (2.5). 
It is possible to derive an expression or estimator for 
B which for large T is nearly unbiased and also to derive an 
approximate expression for the variance of the estimator 
as T, the sample size, gets large. The method is the Tukey- 
Quenouille-Miller jackknife, and it will be discussed in 
pecuron LV.2. 


If the e, (or u, ) are assumed to be normal, then Bp in 


t 
(2.1) and (2.2), InB for (2.3) through (2.5) and a in (2.4) 
and (2.5) are also normal. However, if this assumption is 
not made, the estimators are in general not normally 
distributed. Asymptotic normality can be shown to hold 
immicactmifed estimators to be discussed in Sectiom fye2 


(Miller ,1974]. 
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IV.1.3. The Problem in the Huber "M" Case 


Small sample properties of the Huber "M" estimators have 
not been determined analytically. Most studies comparing 
the properties of these estimators with those of least 
Squares estimators have employed Monte Carlo techniques 
[Huber,1972, Huber,1973, Andrews, et al.,1972]. 

Huber has derived asymptotic results which show that 
the "M" estimators are asymptotically normal under certain 
mathematical conditions. Approximate expressions for the 
variance of the estimator as the sample size, T, gets large 
have also been derived [Huber,1973]* Nevertheless, existing 
mathematical results give no precise guide to action in the 
finite sample case except to use a finite sample version of 
the asymptotic variance. An especiaily thorny problem is 
what to use as an estimate for vance) in (2.3) through (2350 

In the next section the jackknife will be proposed as a 
method of obtaining estimates of the variances of estimates 
for 8 and a. In the case of least squares the appropriateness 
of the jackknife has been established, however the same is 
not true for the Huber "M". This question of the appropriate- 


ness of jackknifing Huber "M" estimates will be addressed in 


Chapter V. 


*#See Appendix D. 
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IV.2. The Jackknife 


The jackknife is a technique which was originally proposed 
by Quenouille [1949] for reducing bias in estimation problems. 
Tukey extended the use of the technique by pointing out that 
it could be used to obtain approximate confidence intervals 
in situations for which standard statistical procedures do 
not exist, or are difficult to apply. Miller [1964] analyzed 
the utility of the jackknife for obtaining confidence intervals 
and presented examples to illustrate occasions when such a use 
is not appropriate. He also showed the conditions under which 
some jackknife estimators (jackknifed transformations of the 
sample mean) are asymptotically normal. Recently, the proper- 
ties of jackknifed least squares estimators have been analyzed 
(Miller, 1974]. 

The jackknife procedure is based on dividing the data into 
groups, obtaining estimates from combinations of the groups 
and then averaging the estimates. More specifically in its 
use in this analysis, a jackknifed estimator is formed in 
the following way. First, successively leave out one 
observation and compute 


“A A “w 


bee is i (T-1) Bn_y 


where B® is the estimate when all the data are used, Brit is 
rg 


the estimate when all the data but the t observation are 
used. B 4s an estimate called the pen pseudo-value. Next 


-t 
compute the average of the pseudo-values, designated by 


vey 





else Es 
W> 


(4.8) = aca 


B is the jackknifed estimator. Tukey has been reported as 
suggesting that under certain conditions the pseudo-values are 


approximately independent and that 


iS ye 3 
Ty (8 =sieeee 
y ae 


i T(T-1) 


(4.4) 
te 


would be an appropriate estimate for the variance of g [Miller ,1964]. 
A major part of the attractiveness of the jackknife in 

this analysis stems from the fact that Miller has derived 

Peoults for the case where , = £(@) and 6 is a least Squares 

estimate in a regression context Miller,1974]. In particular, suppose 

g seme B Miller's results show that under suitable conditions, to 


be specified later, an approximate t-statistic and confidence 


interval can be constructed for 6, using (4.3) and (4.4). 


Although the properties of jackknifed estimators have not 
yet been proven for the Huber "M" and other robust estimators, 
their use in order to obtain an estimate of B and to obtain 
a useable estimate for Og appears reasonable and is one of 
the few attractive options for computing variance estimates 
for "Huberized" estimators. Huber [1972, p. 1053] has pointed 
out that each pseudo-value is a finite sample version of the 
influence of a single observation on a complicated estimation 


[Andrews ,et a1.,1972]. He adds that "As a rule, jackknifing gives 
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a useable estimate for the variance of a robust estimate," 
and that this estimate of the variance can then be used to 
produce an approximate confidence interval. 

In the following section results of the application of 


the jackknife both for least squares and Huber "M" estimates 


ad 


B 
Approximate confidence intervals will also be presented. 


will be presented, together with estimates of o~ ana oma 


iss. Results 


iV.3.1. Basic Results 


Revised estimates of 8 and a were computed by jack- 
Mn eine . This was done for (2.1) through (2.5) 
both when using least squares and when using the Huber "M" 
epeeroaden, With c=1.0. In addition, estimates for the vyarwdance 
one 8 and : were computed, as suggested by Miller in 
[Miller,1974]. All computations were performed both on the 
procurement and the RDTE data. 

Several important questions arise in the course of 
analyzing the results. First, since jackknifing was originally 
proposed as a technique for bias reduction, how do the jack- 
knifed estimates for the coefficients compare with the 
"non-jackknifed" estimates, especially for g in (2.3) throven 
(2.5)? Secondly, how do the variance estimates compare — both 


between equations and between estimation vechniques? 


Bs 





First, comparison of jackknifed versus non-jackknifed 
eStimates for B and a, using least squares and the Huber 'M" 
approaches, is contained in Tables IV.1 (procurement) and 
uve ec (RDTE). 

As discussed earlier, least squares when applied to (2.3) 
through (2.5) in logarithmic form yields biased estimates 
for 8B. The jackknifed estimates for (2.3) do not appear to 
differ very much from the -unjeekknifed estimates. This is 
true for both least squares and Huber "M" estimates. On the 
other hand, there is a marked difference between jackknifed 
and unjackknifed estimates for 8 in (2.4) and (2.5). While 
jackknifed and unjackknifed estimates for 8 differ, the 
estimates for a appear to be equivalent. 

The Huber "M" estimates demonstrate some interesting 
characteristics. For RDTE, the jackknifed and unjackknifed 
estimates for ® in (2.4) and (2.5) are nearly the same. 
However, for the procurement data, the jackknifed estimates 
Giffer considerably from the unjackknifed estimate although 
not as much as is true for the least squares estimates. 
Reasons for these differences will be suggested later. 

A final interesting point about Tables IV.1 and IV.2 
deals with the estimate for 8 obtained by using least Squares 
on (2.2) with the RDTE data. The unjackknifed 8 was 1.09 
while the jackknifed g was .908. One result estimates 
Congress on the average to grant 109% of the request while 
the other says they grant 91% of the request. has Ot 


~ 


will be pursued further after the estimates for Op and oi 
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(1) 


(2) 


(2.3) 


(2.4) 


C255) 


(Top estimate is jackknifed estimate. 


2.1) 


252) 


253) 


Cea) 


ie. 5) 


Procurement 


Least Squares 


oe .970 765 
(1.242) ([Sco)e (Se 


B 


-961 
SIDS), 


»955 
-993) 


sabi 
Se, 


woe 
id teat) 


TABLE IV.1 


( 


fe} 
NA 


NA 


NA 


( 


.994 : 
948) (1. 


Huber 


: 
948 


.961) 


962 


-969) 


-950 


.963) 


967 


76 


alanl) 


ty 


e 


TL 
C: 


2 


Q 


NA 
NA 
NA 
582) 


021 
996) 


Jackknifed Least Squares and Huber "M" Estimates 
Bottom estimate, in 
parentheses, is junjackknifed estimate.) 


Least Squares 


B 


FS) oe 
.982) 


.908 
.093) 


Ora, 
{025 


2584 
028) 


TABLE LVac 
Jackknifed Least Squares and Huber "M" 
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= Sileth ; 
FS a 


RDTE 
Huber "M" 
o} Bg e} 
NA 973 NA 
COO) 
NA 2984 NA 
(.989) 
NA pol NA 
(.989) 
.893 1.036 990 
.903)(1.061) (.990) 
Ba7 8 140 e2 .990 
652) (1 Ore) (.994) 
Estimates 





are presented since these estimates are relevant to the 
question of why there are some large differences between 
jJackknifed and unjackknifed results. 

Tables IV.3 and IV.4 contain the jackknifed estimates 
for 8 and a together with estimates for Og and om . These 
estimates for the standard error were obtained by computing 
the square root of the sum of the squared differences of the 
pseudo-values from the jackknifed estimate and then divided 
by T(T=1). 

The results in Tables IV.3 and IV.4 are consistent with 
the results in Tables IV.1 and IV.2 in the sense that both 
highlight estimation problems for equations (2.4) and (2.5). 
The estimates for Op and o* in (2.4) and (2.5) are very 
large, both for RDTE and procurement. Thus, the confidence 
intervals will be very wide. For the procurement data the 
estimates for Op and Gre are large both when least squares 
and when Huber "M' estimation techniques are used, although 
the Huber "M" standard errors are approximately one haif the 
size of those associated with least squares estimates. For 
the RDTE data, the Huber '™M" standard errors (in (2.4) and 
(2.5)) are less than one-seventh the size of those of the 
least squares estimates. 

Also worthy of note is the size of the estimated standard 
error for the least squares eg in (2.2) when the RDTE data was 


used. Recall, that the unjackknifed least squares estimate 


for B in (2.2) was 1.09 while the jackknifed estimate is SoCo 
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Go. 


(2. 


C2. 


1) 


2) 


3) 


4) 


Procurement 


Least Squares 
8 O 
Beate 961 NA 
(.016) 
Chae st ea x “955 NA 
G GG (023) 
5 
Bx, 1 On: NA 
C7020) 
Mt 
BX, °e 17272 £994 
(2.593) (ACeey 
u,in X 
Bx, "e fom .970 
(. 794) (3066 ) 
TABLE IV.3 


(Estimates of standard errors are 
in parentheses.) 


RDTE 


Least Squares 
8 OL 
us ules .982 NA 
(.013) 
ex, + e€,x .908 NA 
etal (.095) 
ae 
BX, i. ORIG NA 
(.034) 
Ve 
Degas 2564 .893 
Geshe er LINC) 
u,in x 
Bx, e ee em 
v (2-771) “Caoee 
TABLE IV.4 
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Huber 


B 


948 
Pontes 


962 
meh 3) 


~950 
Ogun) 


.967 
200) 


7Ob 
337) 


~~ 


om 


Huber 


SSS: 


melee 


~984 


aOi07 ) 


.987 


.007) 


mekie 


sie 


Oe 


Te 


my 


a 


NA 

NA 

NA 
(1038) 


402 
(038) 


mye 


NA 

NA 

NA 
990 


(.018) 


.990 
(.024) 





ies. c. oVariations,ors'Data Snooping" 


In the results just reviewed two things caught our 
attention. First, the differences between the jackknifed and 
unjackknifed estimates for8® in (2.4) and (2.5) were noticeable. 
This held for both least squares and Huber estimates when 
procurement data were analyzed, and held only for least 
squares estimates when the RDTE data were analyzed. Also, 
least squares applied to (2.2) yielded different jackknifed 
versus unjackknifed estimates for 8 when RDTE data was used. 
A second interesting result was the size of the estimated 
standard errors for B, again in (2.4) and (2.5). The estimates 
were large for both least squares and Huber "M" estimates for 
the procurement data. Least squares estimates ylelded large 
estimated standard errors for Fs in (2.2) 3° (2 4and. (2.5) 
fom the nD data. 

The reasons for the results for (2.4) and (2.5) may have 
been the same both for procurement and RDTE data. However, 
the fact that the Huber "M" estimates demonstrated the same 
overall characteristics as the least squares estimates when 
procurement data was used, and different overall characteris- 
tics when RDTE data was used, leads one to suspect otherwise. 
In order to further explore the question, the pseudo-values, 
scatterplots and the original data were examined. 

Examination of the pseudo-values for the procurement data 
revealed one which stood out. It was the pseudo-value 
corresponding to the item 1953 PAMN: the request was 


$124 million, and appropriations were $113 million. 
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These numbers are an order of magnitude smaller than the rest 
of the numbers in the data set, most of wnich are in the 

$2 to $3 billion range. In Table IV.5, the pseudo-values 

for this item both when least squares and when Huber "M" 


techniques were used are compared to the jackknifed estimate. 


Least Squares Huber "mM" 
B 1 B 1 
(2.4) pseudo-value -35.18 -.670 -16.48 2.80 
jackknifed oe .994 .967 2995 
estimate 
(2.5) pseudo-value 
jackknifed 13.02 5.80 ~25.14 3.88 
estimate mrs .O7 764 1.02 
Table =v. 


Pseudo-Values for 1953 PAMN 


The figures in Table IV.5 indicated that it might be 
wise to compare B (and a) when the full sample was used to 
; (and 1) when 1953 PAMN is omitted from the sample. This 


comparison is found in Table LV for 
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Least Squares Huber "M" 


g a g : 

wens) Full Data ey 948 1 1 .982 
Data without gO 62) 52 Re lis: vee at 959 

1954 PAMN 
(25) Full Data ieee .969 997 .996 
Data without 2.007 .908 1. 328 .960 

1954 PAMN 

TABLE IV.6 


Tables IV.5 and IV.6 indicate the significant impact of 
One observation on the estimates. One of the puzzling facts 
is that this impact is strong not only when least squares is 
used but also when the supposedly robust Huber "M" procedure 
is used. This raises some questions about the Huber '"M" 
procedure. 

peeaminatvion of Sscatterplots of the data indicate that for 
the full data set both least squares and the Huber "M!"! 
procedure appear to be the victims of what might be called a 
DMerengler-effect ." The "straggler-effect" was discussed 
foetivein Chapter II. It is characterized by a scatterplov 


meee FO that in Figure IV.1. 


de Ye 





FIGURE IV.1.The"Straggler Effect" 


IV-16 





The estimated line with 1953 PAMN as part of the data is 
line number (1). The estimated line with 1953 PAMN omitted 
is line number (2). 

Application of the Huber "M" technique does not solve 
this problem. Recall that the technique acts like least 
squares as long as |1n eS (In g+oln x) | < ec. If the 
eapaolute value of the difference is greater than ec themea 
particular observatron, t, does notvreceiwve-as much weight 
as it would under least squares. This affords "protection" 


against outliers such as in the situation in Figure IV.2. 





FIGURE IV.2 


Effect of an Outlier 


(proken line is least squares estimate, 
unbroken line is Huber "M" estimate.) 


If the "straggler" is in the Group 1 residuals Gi alc: 
is one of the observations for which |Iny,- (1nB+a1n el Seen 
then it has received as much (in fact more) weight under the 


Huber "M" approach as it would have under least squares. 
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To summarize at this stage, the preceding discussion has 
led to two useful insights. The first is that the Huber "M" 
Peocedure is not particularly robust in at least one situation, 
the case of the "straggler," if the "straggler" is not forced 
into the Group 2 or Group 3 residuals.* The second is that 
jJackknifing and an examination of pseudo-values is useful for 
spotting this situation. It should be added that examination 
of the original data will alert the -analyst to the possibility 
that a "straggler" effect problem may exist, however examina- 
tion of the pseudo-values is helpful in determining how 
serious it is. 

Table IV.7 contains the revised estimates of the coeffi- 
cients and standard errors for equations (2.4) and (2.5) when 
ene “strageler", 1953 PAMN, is purged from the data. Of 
course, elimination of this point, where both request and 
appropriation are very small will limit generalizations that 
can be drawn from the data to situations where procurement 
requests and appropriations are larger than $100 million. 

In fact it limits generalizations to situations where requests 
and appropriations are greater than $800 million. This is 

not a very serious limitation, however, given the size of 
procurement requests over the last fifteen years. Also 
included in the table are the unjackknifed estimates. Table 
Iv.8 repeats the information in Tables IV.1 and IV.3; that 

is, it presents estimates when 1953 PAMN is not removed fror 


the data in order to contrast these results with those in Table IV. 





*Estimates based on functional forms for 9 similar to 
those discussed in Chapter III (p. III-6) may be more robust. 
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Procurement 
Least Squares Huber 
Unjackknifed Jackknifed Unjackknifed Jackkni fed 


B oO. B fe} B a. B OL 
(2.4) 1.93 .912 1.782 .912 1.337 .959 1.346 .955 
Y, =Bx,e (.532) (.002) (.269) (.023) 
25) ae wamen 908 «ivB29—4908 393928 1960 e263 .963 
¥, = Bx, Me t (.652) (.002) (.262) (.025) 
ite vey 
Results when 1953 PAMN is not uSed. 
(Numbers in parentheses are estimated 
standard errors) 
Procurement 
Least Squares Huber 
Unjackinifed Jackknifed lUnjackknifed Jackknifed 

| B a B a 8 a B 1 
(2.4) i, 1.47 .948 nas sts eee .982 Beas ee 

a a. v P05) Geo 2259 027 
Y_ ~ BX, e 
(2.5) .997 .996 jou .970 deen .969 1105. Oe 

eee (7911) ( 066) (337) (.038) 
Ve — BX, e 

TABLE IV.8 


Results with 1953 PAMN included in data. 
(Numbers in parentheses are estimated 
standard errors) 
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There are some interesting differences between the results 
in Table IV.7 and Table IV.8. Without 1953 PAMN included, the 
estimates for a@ in Table IV.7 indicate that the effect of the 
size of the request on the percentage of the request that was 
granted is more important than the results in Table IV.8 
(where 1953 PAMN is included) indicate. For example, for 
equation (2.4) the jackknifed least squares approach estimates 
a to be .995, when 1953 PAMN is in the data set. Ana of 1.0 
says that the percentage of the request granted does not, on 
the average, change proportionately as the size of the request 
changes.* When 1953 PAMN is left out of the data, the estimated 
ais .955 for the jackknifed least squares estimate. When 
using the jackknifed Huber estimate, note that for equation 
(2.5) the estimated a is 1.02 in Table IV.8, which implies 
that the percentage of the request granted increases as the 
request increases. In Table IV.7, the comparable estimate 
for aoas .908. 

The differences between g in Tables IV.7 and IV.8 are 
eonsicgerable also. Note for the jackknifed Huber the g os 
1.263 in Table IV.7 compared to a g offi 65ucaelapllieos I Vase 

Insofar as the estimated standard errors are concerned, 
in every case but one (the standard error for g in equation 
(2.4) when a jackknifed Huber estimate is used), the standard 


errors are smaller when 1953 PAMN is not part of the data set. 


*Recall that a may be interpreted as the elasticity of 
the percentage of the request granted with respect to the 
request. 
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The estimates for equations (2.4) and (2.5) for the 
procurement data raised questions both when least squares 
and when the Huber "M" approaches were used. However, the | 
estimates for equations (2.2), (2.4) ana (2.5) for the RDTE 
data raised questions only when the least squares method was 
used. Again, inspection of pseudo-values for theRDTE data 
and re-inspection of scatterplots helped to clarify these 
questions. It appears that one data point, 1955 RDTE Navy, 
for which the request was $61.2 million and appropriations 
were $419.9 million, was seriously affecting the estimates. 

The situation resembles that depicted in Figure IV.2. 

As discussed in previous chapters, the effect of any one 
observation on the estimated line, when least squares is used, 
is directly proportional to the magnitude of the residual. 
However, the Huber "M" technique does not give this observation 
as much weight, and the result is as depicted in Figure IV.2. 

Insofar as equation (2.2) is concerned, the estimated 86, 
using least squares, is seriously affected by this one obser- 
vation, 1955 Navy RDTE, because the estimator is the sample 
mean Y,/X, ratio. The effect of extreme observations on the 
Sample mean as a location parameter estimator has been well- 
documented in Andrews, et al. [1972]. 

Table IV.9 contains the revised estimates after 1955 Navy 
RDTE was eliminated from the data set. Also included are the 
unjackknifed estimates and estimates of the standard errors. 
The least squares method was used to obtain these estimates. 
Table IV.10 contains the same information as Table IV.9, 


except that 1955 Navy RDTE was not left out. 
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RDTE 


Least Squares 


Unjackknifed Jackknifed 
B O B O 
(2.2) y, = Bx, + €,x .999 NA .999 NA 
te t treats (016) 
o Uy 
(2.4) Yeo = 8x, e 1.006 945 914 Iya 
Cars) (.02) 
u, in Xy 
ee) y, = Bx, € IAL .828 MLS 1.03 
( 156) (.02) 
TABLE veo 
1955 Navy RDTE not part of data 
RDTE 
Least Squares 
Unjackknifed Jackknifed 
B 1 8 OL 
(2.2) y, = 8x. + €.x 1.093 NA 908 NA 
5 £ eesb (.095) 
u 
(2.4) y, = Bx, Te t 2.028 903. 1.584 893 
(1.238) (ony) 
u, in Xy 
Cay. = Bx, e 2 Be 852 1.377 .838 
Comer) (.188) 


UAT sen v ako 
1955 Navy RDTE part of data 


IV-22 





Tables IV.9 and IV.10 reveal some rather dramatic differ- 
ences between the results when 1955 Navy RDTE is part of the 
data set (Table IV.9) and when it is not part of the data set 
(Table IV.10). On the average all of the estimated standard 
errors are smaller in Table IV.9 by about a factor of 10. 

The coefficients also differ. For example,- in 

Table IV.10 both the estimates for a are less than one. In 
Table IV.9 the jackknifed a estimates are close to 1.0. The 
large discrepancy between the jackknifed and unjackknifed 5 
(and B) for equation (2.5) in Table IV.9 appears to be due 
to a combination of "straggler effects" and outliers. 
Examination of the pseudo-values revealed three more items 


which appeared to be influential in causing the differences 


observed. 

Item Request Appropriation 
1954 Army 475.0 345.0 
1953 Navy (iar | 70.0 
1954 Navy 58.6 TA9 


(in millions) 


Table IV.9(a) contains revised estimates for a,8 and the 
standard errors when the least squares method was applied 
(unjackknifed and jackknifed). Notice that the unjackknifed 


and jackknifed estimates are not as different from each other 


as in Tables IV.9 and IV.10. 
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Unjackknifed Jackknifed 


8B a g a 
(2.2) > Bx, + E,X, 1.009 NA 1.008 NA 
(.015) 

a UL 

(2.4) y, = Bx, e 2 Seal 122 971 
(.14) (.015) 

A u,in Xy 

(2.5) y, = Bx, e 1.19 .976 es 976 


Ces) (.015) 


TABLE IV.9(a) 


When examining differences between the estimates for 8 in 
(2.2) and (2.4)-(2.5), it should be recalled that (2.2) and 
(2.4)-(2.5), when a #1, are different characterizations of 
appropriations behavior. In (4.2) appropriations are charac- 
terized aS a constant percentage of the request while in 
(2.4)-(2.5) that percentage changes as the size of the 
request changes. In the next chapter, we will investigate 
whether the estimates for 8 would be different among (2.1) 


through (2.5) if the true model is (2.3). 


Confidence regions: 

The recent work of Miller [1974] has shown that under 
certain conditions, in large samples, when F(B) is jackknifed 
(2 being a least squares estimate) a t-statistic confidence 
tnterval can be constructed for £(B), Using che Jackcn ied 
estimate of the standard error of f(B). 

Tyo of the assumptions that must be met in order to use 


Miller's results are: (1) the fourth moment of Ey (or u,) 
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must be finite, (2) f must have bounded second derivatives 

in an open interval around 8. These assumptions are satisfied 
0 2 

if the Ee, are assumed N(0,0°) and if f(8) equals Beer a 

as In the models used for this study. (3) The other assumption 


T 
deals with whether = +E , a positive definite matrix as 


Tse , If convergence takes place, then Miller's results 
hold, given the other two assumptions. 

Although Miller's results have not been extended to 
jackknifed Huber estimates, following the lead of Huber 
concerning jackknifed estimates [Huber,1972,p. 1053] we 
decided to compute confidence intervals for the Huber 
estimates that are analogous to those computed for least 
squares.* 

ror ovoth the least squares and Huber estimates tne nunber 
of degrees of freedom was large enough to permit the use of 
the normal approximation for the t distribution. Confidence 
intervals based on a standard normal interval of (-2.00,+2.0a) 
were constructed, yielding a nominal confidence level of .95. 
Based on the considerations discussed in the previous section, 
the straggler observation 1953 Navy PAMN was omitted from the 


ne 


data when computing the confidence intervals for 8 and a in 


*Huber [1973] suggests some possible estimates for op in 
(2.1). The suggestion was expanded upon for the purposes of 


deriving confidence intervals for 8 in (2.1) through (2.5). 
Estimated intervals and how they were derived are discussed 
mmePendix D of this chapter. Apparently there has not been 
determined any way of finding out whether the jackknifed 
Huber or the approach followed in Appendix D produce the 
better interval without conducting some sampling experiments. 
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equations (2.4) and (2.5) for the procurement data. Also, 
the RDTE outliers 1955 Navy and 1954 Army and the stragglers 
1953 Navy and 1954 Navy were omitted from the data when the 
least ‘squares estimates were used in setting up the confidence 
intervals for the parameters in equations (2.2), (2.4) and 
(CEs 

Tables IV.11 and IV.12 contain the confidence intervals 
for 8 anda for procurement and RDTE data, both when least 
squares was used and when the Huber "mM" approach was used. 

Among the most noticeable aspects of Tables’ I1V¥.11 and 
IV.12 is the fact that for the procurement data the intervals 
for all the Huber "M" intervals are smaller or the same size 
as the least squares intervals. For the RDTE data, the Huber 
tw" tntervals are smaller for (2.1) - (2.3). For (9.1) =<) 
the least squares intervals (with 4 points not included in 
the data set) are slightly smaller than the Huber Ha as 
intervals (with all the data ineluded). Since a theory of 
confidence intervals for jackknifed Huber "M'" estimates has 
not yet been derived, one ought to be cautious in drawing 
inferences from the differing sizes of the confidence 
intervals for least squares and the Huber "M" estimates. 
The sampling experiments reported in the next chapter shed 
more light on this question. A second fact is that all of 
the intervals for the coefficients in C2) elauctenvyetoy (22 6) 
are considerably smaller than those for (220) and =(2.5) aud 
those for (2.4) are slightly smaller than those for (2.5). 


This point will again become of interest in Chapter V. 
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( 2.) y, = Bx, + €, 
(2,2) Y, = BX, + €,%, 
(2.3) y,=ax,e ° 
(2,4) y, =Bx,"e * 


Wen x 
(PES) y, = Bx, € vt 


251) VY, = BX, ate C+ 


(2.2) \ a oe . E, X, 
Uy 
(2.3) Y, = Bx,e 
u 
(2.4) yeaex, © Lu 


in x 
25) vein axe * LU 


“es 


Procurement 


TABLE. 1s) See 
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Least Squares Huber 
8 1 g oO. 
ee jo) NA 92 oe NA 
Ql. 41.00) NA (.94., .99') NA 
Sols eee (cle) NA (.94 , .97 ) NA 
f2. 52.05 ) (.99°5.92 ) “Cree iece= (95 0 
53 53-13) (.90Bs9t Cee eo  (oce 
TABLE IV.11 
RDIE 
least Squares Huber 
B oO B O 
5 Ole) NA (.96 , .99 ) NA 
98 ,1.04.) NA (.97 51.00 ) NA 
[Oba eeiksO>) NA (OF ,teeen) NA 
94 ,1.50.) (.94 51.00 ) (.75 {1232 (95 eames 
92 ,1.48) (.95 1:01 ) (67 .1.38) Cos eee) 





IV.4. Summary 


This chapter commenced with a discussion of the problem 
of estimation of the variances and construction of confidence 
intervals for Huber "M" and least squares estimators. The 
jJackknife was proposed as a technique for estimation of the 
variances. Next, jackknifed estimates were computed using 
the procurement and RDTE data. For the procurement data a 
"straggler," 1953 PAMN appeared to be having a disproportionate 
affect on the jackknifed Huber "M" and least squares estimates. 
For the RDTE data, four observations, especially 1955 Navy 
RDTE, were having a disproportionate affect on the least 
squares eStimates. Confidence intervals were found for 
parameters in (2.1) - (2.5) both on the basis of jackknifed 
least squares and on the basis of jackknifed Huber "M" 
estimates. In Chapter V some of the sampling properties 
of Huber "M" and least squares estimates will be explored 
when the error distribution is Cauchy. The coverage of the 
jackknifed Huber "M" confidence intervals will also be 
addressed. Finally, questions raised in Chapters II and, to 
a certain extent, in Chapter III, concerning the possible 
existence of special eras of Congressional activity and 
special Congressional treatment of the Air Force in the 


RDTE area will be discussed. 
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APPENDIX A 


In Section IV.3, it was stated that a straggler observa- 
tion disturbed the jackknifed least squares and Huber "MM"! 
eStimates for the procurement data and at least one outlier 
observation disturbed the jackknifed least squares estimate. 
It was noted at the time these statements were made that 
scatterplots, pseudo-values and the raw data were of assist- 
ance in identifying stragglers and outliers. Normal plots 
provided another data analytic aid. 

As mentioned in the body of the chapter, it has been 
reported that Tukey has suggested that under certain conditions 
pseudo-values are approximately independent and normally 


distributed [Miller,1964]. If this were true, then 


pi = im nA -1 
ee ete)! * gy * 


where B_(t) is the 4 oh 


ordered pseudo-value and Hoey is the 
gh order statistic from a distribution which is U(0,1). 

Under these circumstances, it is reasonable to consider the 
regression of the t 9 ordered pseudo-value versus E(@"(¥¢4y)) 
where T is the total sample size [Wilk and Snapiro,1965]. 
However E(@ (Yc, y)) must be determined by numerical 

methods [Teichrow,1962]. In its place, what has teen found 


to provide a good approximation is the standard normal inverse 
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of the expected value of the es order statistic of a 
sample of size T from a uniform, U(0,1) distribution, or 
o-1 (2) [Blom,1958,pp. 70-71]. 

For jackknifed least squares and Huber tw" estimates, 
so-called "probability plots" were made of Bt) versus 
677 (Gea). Although the plots did not (and were not meant tO) 
provide a means of formally testing the normality of the 
pseudo-values, they were useful as a means of examining 
characteristics of their distribution. 

Appendix B contains some of the typical probability plots. 


Most have a shape similar to that in Figure IV.A.1. 





-1/ t 
Die at 


} 


FIGURE IV.A.1 


The changes in slope associated with very high and very low 
values for go (ES) are more dramatic for some of the plots 
than for others. Plots like those in Figure LV nh. ee ce 
frequently associated with distributions which, though 


symmetric, have thicker, fatter tails than the normal, like 
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the double exponential and the Cauchy [Wilk and Gnanadesikan, 
1968]. This is easily seen by noting that the increasing 
Slope with increasing values of \o7" (ae) | ean be associated 
with the fact that for high values of | 1(;4,)| the values 
for me.) are larger than would be expected if the 8's were 


normally distributed. 
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APPENDIX B 


The plots contained in this appendix are a subset of 
the plots which were made. However they do indicate general 
characteristics cf all the plots since they become more 
linear when (1) straggler and outlier observations are 
omitted (e.g., compare pages B-2 and B-3) and (2) the Huber 
"M" technique, as opposed to least squares is employed 
(e.g. compare pages B-2 and B-4; also compare pages B-5 
and B-6). 

This appendix contains normal plots for the pSeudo-values 
of jackknifed least squares and jackknifed Huber "M" 
estimates. The rationale behind these plots is discussed in 
Appendix A of this chapter. For all plots oh (ae) aS 
plotted as the horizontal component while Bune the t°> 


ordered pseudo-value, is plotted as the vertical component. 


The following is a listing which identifies the plots. 


Least Squares Huber M 
Page Page 

22) y, = Bx, + €.%, 

RDTE B-2 B-4 

RDTE (59 observations) B-3 

ae 

ee) Ye 7 Bx,€ 

Procurement B-5 B-6 
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APPENDIX C 


In this chapter, one of the models that was discussed 


was the following: 


Y, = BxX,€ 


where yy is the appropriations in year t 
Xy is the request in year t 
U, is a random variable with expectation of 0 and 
variance of one 
One method of estimation of 8 is to take logarithms of 


both sides of the equation to yield: 


in a Pie: 2) lm Xy Ly Uy 


The least squares eStimator for Infgis 


= ve 
oe or in 
lags y 7 
t=1 
eee elas 
If the estimator for B is taken to be B =e » Ghen 


expected value of 8B is given by 
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T 
jy en 
t=] 


E(B) 


y. 
t Al 
Fig ) ] 


T “t 
Bx,e 
EC (mn —2 1/7 y 


t=1 


Jy u T 7 u 
EL (Be byt i> fee Cn oc Syyt7T = Teter *) 
= t=lL 


af ithe uz, 'Ss are assumed to be independent and identically 
distributed. Thus, in general the estimator is biased since 
u 
E(e ty is not equal to 1.0 for any nondegenerate distribution 
fy 
oe. U, 
(Feller, 11,1958, p. 152] our assumptions imply 


for which E(u, ) = 0, for by Jensen's inequality 


u 
In fact, by the Taylor series expansion of the exponential 


a0! e 
ELe °] >1t = 


Thus, E(B) > B 
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APPENDIX D 


In the body ofthis chapter, confidence intervals were 
developed for the coefficients in (2.1) through (2.5) by 
jackknifing Huber "M" estimates. Huber [1973, pp. 814-815 
and p. 818] has suggested using the following as an estimate 


of 622 


, imeanecauacion lake (2.1). y, = Bx, + € 


7 t' 


T-N(T, - Cy y,- BX, ve + es)* (N(T. ) +N(T2 )) 
etl + SICuD tet, = IN : 


where T is the total sample size, s is the estimated scale 
parameter, B is the estimated coefficient, N(T, ) is the number 
of items in the Group i residuals. This estimate, is based 


on the theoretical value for the variance of the estimator 


derived by Huber [1973, pp. 812-813]. 


IV-D-1 





Confidence intervals were constructed for (2.1) through 
2.5) based on Huber's suggestion. For those cases where 


g = ein 8 


» that is equations (2.3) and (2.5), confidence 
intervals were constructed for ln 8 and the upper and lower 
limits of these intervals were then transformed into upper 
and lower limits for 8. In other words, if we let the upper 
limit of the confidence interval for ln 8 be designated as 
(1n g)t then the upper limit for 8 , which is designated as 
gt was computed by taking gt = ein B)T 


The estimated confidence intervals are contained in 


Tables D.1l and D.e. 


Procurement 
B oO 
(2.1) yy, = Bx, t+ €4 (oueesica NA 
(222) ¥, = Bx, + €,%, (Coen so) NA 
UE 
foe Y, - Bx,e Coo NA 
(2.4) y, = Bx at (eciaei52) Gxotinec! 02”) 
c c (.90 ,1.99 )* Co ee Ce 
in u,in x, ; 
: = | Ce poe) (796 7,1 Oee) 
ae zt a: : C90 Gio ie)s C.91> Shee 
TAB E Eee Oise 


*Estimates computed without 1953 PAMN in data set. 


IV-D-2 





RDTE 


B Oo 
(2.1) y, = Bx, + &, (.96 ,.99 ) NA 
(2.2) y, = Bx, + €,x, (.97 ,1.00 ) NA 
Me 
ey. ~ 8x,e (.97 ,1.00 ) NA 
oe “Mt 
(2.4) Y, = Bx, e C29 ieee Coot alone 
5 u,in x, 
ae ¥, = 8x, © (.89 ,1.18 ) (.97 12005) 


TARDE pee 


The estimates in Tables D.1 and D.2 differ somewhat from 
those in Tables IV.11 and IV.12. This is especially true of 
the estimates for (2.4) and (2.5). However, the intervals 
are very similar for (2.1) through (2.3). The guestion of 
which approach, the jackknife or Huber's suggestion, produces a 
higher coverage rate,andon the average, smaller intervals will 
be addressed in the next chapter where artificial data are 


generated and confidence intervals constructed using both 


approaches. 
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CHAPTER V 


In the previous two chapters least squares and Huber "M" 
estimates were computed and compared using five different 
equations which were designed to represent general tendencies 
on the part of the Congress in the budgeting of defense. In 
Chapter III point estimates were derived, while in Chapter IV 
confidence intervals were constructed. At this point in the 
enalysis, there are some important unresolved questions. 
Which of the equations best represents the Congressional 
tendencies we are trying to characterize? How do Huber "mM" 
and least squares estimates compare when the error distribution 
is known to be a long-tailed one, such as, for example, the 
Cauchy. How sensitive are least squares and Huber "M" to 
specification error? For example, suppose the "true" model 
were given by equation (4.2) but equation (4.4) was used for 
estimation purposes. Since there is as yet no formal theory 
which supports the jackknifing of Huber "M" estimates, how do 
jJackknifed estimates, both point and interval estimates, 
behave when computed for artificially generated data? These 


are questions which will be addressed in this chapter. 





This chapter is divided into four basic sections. The 
first section discusses the background and rationale of Monte 
Carlo studies which were designed to examine questions such 
as the ones just raised. In the next section, the specifics 
of the Monte Carlo techniques used are presented. Results 
of the studies are presented in the third section. The final 
section re-examines some questions raised in previous chapters 
concerning "eras" in Congressional behavior, and similarities 
and differences between action on the three services! requests. 
These questions are discussed in the light of results from 
the Monte Carlo studies and the results of analysis in 


previous chapters. 
V.1l. Background 


V.1l.1. Small Sample Properties 


Asymptotic properties for Huber "M" estimates have been 
derived by Huber in [1973]. To date, however, there have apparently 
been no formal studies of the small sample properties of the 
estimates. Results of certain Monte Carlo Studies were 
reported in [Huber,1973] and (Andrews, 1973]. Since the 
estimation procedure was applied to five different equations 
using RDTE and Procurement data, we have sought to investigate 
sampling properties of the estimates for relatively small 
samples of size 63 (the same as that for the RDTE data) for 
each of the five eauations. Also since the RDTE data cOntaimed 


some unusual or extreme observations, representing Congressional 
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appropriations much larger (or smaller) than usual, the 
artificial data generated was based on samples from a 
long-tailed error distribution, the Cauchy. 

Among the results sought from the Monte Carlo studies 
are answers to questions concerning the distribution of the 
estimates, and the expectation and variance of the estimates 


where these concepts are meaningful. 


V.l.2. Specification Error 


A major difficulty which arises when trying to estimate 
coefficients in regression equations is the problem of 
specification error. Among the problems generally referred 


to as specification error are incorrect functional form and 


The problem of an incorrect functional form can best be 
explained using an example. Suppose tentatively that the 


Teme model is (2.1) — that is, i Bx, af Ee, but equation 


u 


t 4s chosen by the analyst to depict the 


C2), Vig ge 
appropriations process. Can one count on the estimates for 

a and 8 for (2.4) to generally be near the values of 1.0 

(for and the true 8 (for 8)? On the other hand, what if 
the "true" model were (2.4)? What would the estimates for 8 
Mme elook like in this case; this obytousdy depends, 

at least partially, upon the "true" value for a, Although 
some analytical work has been devoted to examining the problem 


of incorrect functional form, especially in a least squares 


context [Box and Cox,1964, Zarembka,1968], we chose to address this 


problem using Monte Carlo methods, by generating data using 
one equation, (2.3) for example, and estimating coefficients 
using the five different regression models. The behavior of 
least squares and, more importantly, Huber "M" estimates was 
compared by this process. Results of these studies are 
meporved in section 3 of this chapter, 

Another specification problem arises if the error terms 
in the equations are not independent. More specifically, 
suppose the Ee, in (2.1) are not independent but instead are 
related by first order autocorrelation. Huber has said that 
lack of independence of the error terms causes difficulties 
for Huber "M" estimates — at least for deriving asymptotic 
results {Huber, 1973 , p. 804]. Tests for autocorrelation in 
a least squares context and correction tc the functicnal form 
based on the results of these tests have been used extensively 
by econometricians [Kane,1968,pp. 364-373, Durbin,1951,Tneil,1961)]. 
However, often these tests are inconclusive, and such tests 
and adjustments are not well understood when Huber "M" 
estimates are computed. Consequently, in order to examine 
this problem and to ascertain its potential impact in the 
specific situation being studied here (that is, Congressional 
action on defense budgets), artificial data resembling the 
actual RDTE data was generated using error terms related 
Bimewem tirst-Order autocorrelation. The effect on the 
distribution of the Huber "M" estimates was assessed; results 


will also be discussed in section 3. 
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V.1.3. Confidence Interval Estimates 


In Chapter IV, confidence intervals were estimated for 
coefficients in equations (2.1) through (2.5) using 
the jackknife. Also, confidence intervals 
were constructed for the Huber "M" estimates using a finite 
sample approximation to the asymptotic variance of the Huber 
"M" estimate which was suggested by Huber [See Appendix D, 
Chapter IV]. Several questions were left unresolved. First, 
eince no theory exists for jackknifing Huber "M" estimates. 
was the procedure followed in Chapter IV justified? Even if 
the procedure were justified, which method provides the 
better estimate in terms of coverage of the true value of 
the parameter and size, the estimate suggested by Huber or 
the jackknifed estimate? 

Again, these questions were investigated using Monte 
Carlo techniques. Artifical data was constructed, using 
Cauchy errors. Because the datawere constructed artifically, 
the true parameter value was known. Next, 95% confidence 
intervals were constructed for a number of samples of size 
63, using the jackknife and using Huber's suggested estimate. 
Tine coverage and size of these confidence intervals were 


compared. The results follow in Section 3 of this chapter. 


V.e. Techniques 


The artificial data set which was analysed was constructed 
in the following way. The 63 RDTE observations provided the 


values for the x variable in equation (2.3). 


V=) 





The values for u, were generated by sampling from a Cauchy 
distribution. A value for 8 equal to .989 (the value 
estimated for the real or non-artifical data using the 
Huber "M" technique) was used as the "true" value for the 
parameter. Using these three components (an assumed 8, the 
X+ values from the original data, the Cauchy distributed Uy 
values) sample vie Values were generated. 

The Cauchy random errors were assumed to have a value 
of zero for their location parameter. The value for the 
scale parameter which was used was the median absolute 
residual for the original RDTE data when equation (2.3) 

(in logarithmic form) was analyzed using the Huber "M" 
approach. In those cases when the "true" model was assumed 
to be (2.4) the values used for 8 and a were 1.061 and .99 
respectively, the estimated values for the original data 
when the Huber "M" technique was applied to (2.4). The 
scale parameter for the distribution of uz, Was set equal to 
the median absolute residual for (2.4) when the Huber "M" 
approach was applied. 

The Cauchy errors were generated using a uniform random 
number generator devised and tested by Lewis and Learmonth 
[1973] and then applying an inverse transformation. 

In some cases it was desired to introduce autocorrelated 
errors. In order to to this, a basic Cauchy error, V,, was 


generated for observation one. However, for observaticn two 


the error was computed to be equal to Us where 





and V5 is a Cauchy error 


independent from Vy: 


In general, Uy = To) Vi} hae vo This produced a 
sequence of correlated Cauchy errors {Uy sUgs+++sUg) with 
the same scale as the independent Cauchy errors, 


{V1 sVoo+++5V63) .% 


V.3. Results 


feet. Huber "M" and Least Squares Estimates with 
Artificial Data: Monte Cario Sampling Properties 


In the previous section we described the manner 


in which artificial data were generated. A total 


¥This is true because the average of two independent 
identically distributed Cauchy random variables is a Cauchy 
random variable with the same distribution (i.e. scale and 
location parameters). 





of 1000 samples or histories of size 63 were generated. For 
these samples, the coefficients in (2.1) through (2.5) were 
estimated by least squares and the Huber "M" technique. The 
empirical frequency distributions were examined and comparec. 
Table V.1 gives the sample mean values for the coefficient(s) 
estimated for (2.1) through (2.5) using least squares and 
the Huber "M". Below these mean values are the sample standard 


deviations. 


Huber "M" (c=1.0) Least Squares 
B a B 7 
(2.1) y, = Bx, te 990 NA NA 
tC t (1009) 
We) yy. = 8x, + €.x 990 NA : NA 
: os £ tt (7608) 
u 
ce 
(2.3) y, = Bx ,e .989 NA ieaOu2 NA 
> * (008) 
ely VYeo= Bx a, t Side) 1.000 3626. 997 
i (.068)  (.010) = (6x105) (247) 
ote 1.000 998 5.415 
2 15) = ~x, e .992 ; : ; 
=? "t £ (.068) (.010) C2087). (COO oD 
TABLE V.1 


For (2.1) and (2.2), using least squares, the Aisin. On oF 


estimators was very skewed with approximately 3/4 of the observations 


lying between .92 and 1.05 and the remainder scattered among 


values greater than 1.05, some values being extremely large. 


The few large values made the computation of a mean seem 
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meaningless. Actually, when least squares is applied, the 
distribution of the estimated § has no finite expected value. 


Recall from Chapter IV, Appendix C, that for the expected value 
of the estimate for 8 in (2.3) 


A u 
E(8) >BE(e ©) , 


However, if Uy is Cauchy distributed, then E(e %) does not 
exist. This explains the extremely erratic sample mean and 
variances for (2.1) and (2.2) and the unusual values found 
meteet ee. 5) through (2.5). 

On the other hand, it is not clear whether the expectation 
of the Huber "M" estimator exists or not. The sample means 
and variances derived were reasonable and in fact for (2.1) 
through (2.5) the sample means came very close to equalling 
the true parameter value, .989. It should be noted that the 
mean values for B and a in (2.4) and (2.5) were .992 and 
1.000. This is interesting since, in using (2.3) to generate 
Miemcdaba, o was implicitly set equal to 1.0. However nete 
that the sample standard deviations for 8 Tul MCA GH) iwlasexeibieio: 
(2.5) were seven to eight times larger than the sample standard 
deviations for g in (2.1) tnrough (2.3). These facts indicate 
that if data are generated by (2.3), on the average the 
estimated results will be the same no matter which equation 
is assumed to be the correct model for estimation purposes. 
However, it is not altogether unlikely on any one occasion 


memorechne estimate values may differ considerably, with the 
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estimate for g using (2.2) being near .998 and that for 
(2.4) near 1.06. These two values are only one sample 
Standard deviation unit away from the sample means of the 
respective equations. 

Home each equation (2Viemehrough (2.5) normariolopsnon 
the 1000 estimates were constructed. Once again, the 
rationale behind plotting the ordered estimates in this way 
is that if the distribution of the estimates were normal, 
then the plots would be approximately linear. The plots 
for the Huber "M" estimates, found in Appendix A of this 
chapter, appear to be almost perfect straight lines. Sample 
skewness and kurtosis were also computed for each of the 
sets of 1000 estimates and goodness-of-fit tests run on 
each set [See Appendix A]. 

The least squares sample mean and variances were for 
most cases useless as indicators of the location and scale 
of the distribution of least squares estimates of 8. 
Alternative indicators of the location and scale of the 
distribution are the median and semi-interquartile range. 
The results are contained in Table V.e. 

If one compares the least squares results in Table V.2 
with the Huber "M" results in Table V.1 it is seen that the 
central tendency (measured by the sample mean) of the Huber 
mi" resuits, for (2.1) through (2.5), is slightly closer to 
the true parameter value of .989 than the central tendency 


(measured by the sample median) of the least squares results. 
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Least Squares 


S) a 
(2.1) 1.008 NA 
Cs Ouar) 
(2.2) 1.00 NA 
| (.866) 
(2.3) .988 NA 
C082) 
GA) 2999 .998 
(odefia> CeOar2 ) 
(2.5) .999 994 
(.022) C.IT97) 
TABLE V.2 


Also the dispersion of each of the Huber "M" results (measured 
by the sample standard deviation) is smaller than the 
dispersion of the least squares results (measured by the 


seri-interquartile range). 


V.3.2. The Huber "M" and Specification Error 


The remainder of this section will be devoted to further 
Giscussion of Huber "M" estimates and the properties which 
they demonstrate in the Monte Carlo studies. First the 
question of specification error will be explored; then the 
coverage of confidence intervals constructed by jackknifing 
Huber "mM" estimates will be compared to the coverage of 
intervals constructed using Huber's suggested estimate for 


the standard error of an estimated coefficient. 
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(14) Incorrect Functional Form 

The first specification error problem is concerned with 
what happens to the estimates for 8 if the data are generated 
uSing (2.3) and coefficients are estimated using one of the 
other equations. Table V.1 contains some very interesting 
results. 

On the average, it appeared to make very little 
difference which equation was used for estimation purposes. 
There was almost no perceptible difference between the results 
obtained when using equations (2.1), (2.2) or (2.3). These 
estimates, on the average, were equal to the true value of 
the parameter which generated the data. The distributions 
of estimates when using (2.4) or (2.5) were somewhat more 
widely dispersed than’ those for (2.1) through (2.3). (The 
sample standard deviation for Q uSing (2.3) was .008 while 
the sample standard deviation for g using (2.4) was .068.) 
Also, the estimates for g using (4.4) - (4.5) on the 
average, were slightly greater than the true 8 value of .989. 
In a way this is a tautology. If estimates are equal, then 
there..is little to choose from for the models (2.3) and (2.4) 
become equivalent. 

In order to examine this question from another viewpoint, 
data were generated using (2.4). Table V.3 contains sample 
means and standard deviations of the distributions of 1000 
estimates of 8 and a (where appropriate) using (2.1) through 


u 
(2.5) when the data were generated using v4 eee Wont x, °77e t 
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Uu 


Data generated using Y, = 1.061 ieee : 
B 1 
(2.1) .983 NA 
(.009) 
2.2) 990 NA 
(.008) 
2.3) 990 NA 
(.008) 
Ger ) 1.066 .990 
(.069) (.009) 
(2.5) 1.066 990 
(2071) (.009) 
TABLE V.3 


Means and standard deviations of 
1000 estimates 

Notice that on the average the estimates using (2.4) and 
(2.5) were nearly equal to the true values for 8 and o cof 1.061 
and .990, respectively. The average estimates for 8 using 
(2.1) through (2.3) were not equal to the average estimates 
using (2.4) and (2.5), which is to be expected since if a #4 1.0 
the interpretation of B in (2.4) through (2.5) is different 
from that in (2.1) through (2.3). On the other hand the 
estimates for B using (2.2) and (2.3) were on the average 
equal to the values obtained from the real data when using 
the Huber "M". The average estimate for 8 using (2.1) was 
Biepe tO, but not equal to, the value obvained when thestuber 


"Mu" was applied to the real data. (That value was .977.) 
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Another aspect of the results is that although true model, 
the one which generated the data, was (2.4), the sample 
standard error for the distribution of 8 when (2.4) was 
used for estimation purposes was larger than that for 8 
when (2.1) through (2.3) were used for estimation purposes. 
This highlights the fact that if ao # 1 in (2.4)-(2.5), the 
parameter 8 has a different interpretation from the 
parameter 8 in (2.1)-(2.3). 

The results of the Monte Carlo studies of specification 
error when the data was generated uisng (2.3) or (2.4), and 
estimated using (2.1)-(2.5), help to clarify some of the 
issues involved in model selection. Assuming one limits 
his characterization of Congressional appropriations behavior 
to models (2.1)-(2.5), then if the estimates for 8 in 
(2.1)-(2.5) are the same, with 3 = 1.0 in (2.4)-(2.5), then 
G@meeas left with little choice but to characterize the usual 
result of Congressional appropriations action as appropriations 
being a fixed percentage of requests. However if the results 
for (2.1)-(2.3) differ from those for (2.4)-(2.5), with 
5 # 1.0 for the latter models, there are two possible 
explanations. The data could have been generated by a fixed 
percentage model and the high dispersion of estimates (using (2.4)-(2.5)) for 
8 and a resulted in values for 0 and g whieh did not characters ze 
aeprepmiatwons as being a fixed percentage of requests as 
estimated by (2.1)-(2.3). On the other hand, the Monte 


Carlo studies appear to indicate that if the data were 
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Generated by a variable percentage model like (2.4), then 
the average estimates for (2.4)-(2.5) will differ from the 
average estimate for (2.1)-(2.3). The dispersion of the 
estimates for (2.1)-(2.3) may be of assistance in this 
case. Further studies, using different scale factors for 
the errors indicate that in many cases when the true model 
was (2.3) the estimates for B using (2.1)-(2.3) were much 
less dispersed than those for 8 using (2.4)-(2.5), but when 
the true model was (2.4), the estimates for 8 using 
(2.1)-(2.3) were less dispersed than those for 8 using 
(2.4)-(2.5) by a smaller margin. The problem of 
specification error is one of the areas, requiring more 


research. 


(ii) Autocorrelated errors 

Table V.4 contains the sample means and standard 
deviations for 1000 Huber "M" estimates of B (and a where 
applicable) when the artificial data was generated using 
(2.3), but with autocorrelated disturbances, and (2.1) 
through (2.5) were used for estimation purposes. 

An interesting characteristic of Table V.4 is the small 
effect the autocorrelated disturbances have on the means 
or average estimates, affecting only (2.4) and (2.5) 
Slightly. However, note that autocorrelation does appear 
to increase the dispersion of the estimates. The experiments 
which were run using a second scheme (where errors took the 
form up, = -9U,_4 + 5V,) further emphasized these conceluciogs. 
When this stronger autocorrelation scheme was used the 


Mmeeneecatimates were still close to the true parameter 
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u 
Data generated by y, = .989 X, U with 


(2 


(2. 


sl) 
2) 
3) 
4) 


5) 


Affect of Autocorrelated Errors 


Uy autocorrelated U, independent 
) OL B O 
990 NA .990 NA 
014) .009) 

990 NA 990 NA 
012) 008) 

989 NA 989 NA 
BOM) 008) 

6985 jieOOm 993 se 8 (6/00: 
.098) (.014) .068) ClO1LG, 
996 teow 992 1.000 
gles) (.015 068) (.010) 

TABLE V.4 





value for (2.1) through (2.5). However, the sample 
Standard deviations of the estimates all increased.* 

Another point of interest is that while the normal plots, 
when the u,'S were independent, were very nearly straight lines, 
this did not hold as true when autocorrelation was introduced, 
especially for (2.4) through (2.5). 

In summary, although the Monte Carlo studies of the effects 
of autocorrelation which were reported. here were by no means 
exhaustive, they do indicate that the effects of autocorrelation 
will be mainly felt in the dispersion of the estimates rather 


than in their location or central tendency. 





¥The autocorrelation schemes used were actually quite weak. 
To examine the question in more detail, data were generated 
for which 


ee 1 


U5 = “Uy a -lv5 
Dee yy + .1lv, 


Once again, results showed that the main impact of this strone 
first order autocorrelation was on the dispersion of the 


estimates. 


Vea 





V.3.3. Huber "M" Confidence Intervals 


In Chapter IV confidence intervals were constructed by 
jJackknifing Huber "M" estimates. In order to examine some of 
the properties of jackknifed robust estimates, sampling 
experiments were undertaken. Data was generated, as in the 
previous sections, by the equation y,=.989 x, ©, B = 0.989, 
where the dues were distributed as independent Cauchy random 
variables and the tx, } were the 63 appropriations requests 
found in the RDTE data. A total of 1000 samples were drawn, 
and jackknifed estimates for B (and a where applicable) were 
derived for (2.1) through (2.5). Confidence intervals were 
constructed, using the estimate of the variance of the jack- 
knifed estimator employed in Chapter IV. In addition, for 
these same 1000 samples, confidence intervals were constructed 
using Huber's suggested estimate for the variance of a robust 
estimator, as discussed in Appendix D of Chapter IV. 

When examining confidence intervals two questions are . 
most important: coverage and length. For one thousand samples, 
the percentage of time the intervals covered or included the 
true parameter value was determined. Secondly, the mean and 
standard deviation of the size of the confidence intervals 
were used as measures of the length of the intervals. Of 
course, the ideal interval covers the true parameter value 


the fraction of time specified and is also short in length. 
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The confidence interval chosen was one for which coverage 
assuming normality, is .9544, 

Table V.5 contains a comparison of the coverage and 
interval length for (a) confidence intervals constructed 
using the jackknife, and (b) confidence intervals constructed 
using Huber's suggested approach for computing an estimate 
of the vartance of-a-robust estimator. Recall that if the 
estimator is normally distributed and if the estimate of 
its variance isS correct, one would expect the confidence 
interval to cover the true parameter value approximately 
95% of the time. Because of the size of our samples the 
normal approximation should be adequate. 

Several features of Table V.5 are worthy of note. 
Hirst, in many cases both confidence invger vals etic 
jackknifed interval and Huber's sigeestiee appear to be 
conservative, covering more than 95% of the time. The 
coverage of the jackknifed interval is always greater than 
that of Huber's suggestion. However, the length of tne 
jackknifed confidence intervals is slightly greater than 
the length of the intervals constructed using Huber's 


suggestion. In two cases, (2.1) and (2.5) the coverage of 
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the jackknifed intervals is considerably greater than that 
of Huber's suggested interval. However, in (2.5) the length 
of the jackknifed intervals is also considerably larger. 

TO sum up the reswiits of Table V.5, the following pes. 
should be noted. The coverage of the intervals is very 
encouraging, especially when one realizes that the errors 
or disturbances were Cauchy distributed. Secondly, although 
the coverage of the jackknifed generated intervals is greater 
than Huber intervals, Huber's method produces shorter intervals. 
Considering the extra effort required to construct jackknifed 
confidence intervals the expense may not be justified. The 
exception to this is the case in which the data are generated 
by means of (2.3) and the coefficients estimated using 
equation (2.1): here the coverage of Huber's suggested 


interval is quite low. 
V.4. Congressional Moods or Eras and the Air Force 


V.4.1. Background 


In Chapters II and III, analysis of residuals indicated 
that appropriations from fiscal 1969 to the present were 
smaller than the estimated equations (2.1) through (2.5) would 
indicate if one were to disregard the error or disturbance 
components in those equations. It was suggested that pessiEly 
budget outcomes after fiscal 1969 should be examined 


separate from budget outcomes or appropriations prior to that 


time. Also, recall that for some of the models, budget items 
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for years prior to 1960 appeared in the Group 2 and Group 3 
residuals a disproportionate number of times.. The tentative 
conclusion was that pre-1960 items should be modelled differ- 
ently from post-1960 items. 

Another question which arose in the analysis of Chapters 
II and III centered around the Air Force RDTE budgets between 
1957 and 1969. During those years, the Air Force received 
appropriations which were either larger than the corresponding 
requests or nearly equal to them. This contrasted sharply with 
the experience of the Army and the Navy. As a result, Air Force 
RDTE items between 1957 and 1969 always yielded positive 
residuals after the coefficients in the various equations 


were estimated. 


V.4.2. Breaking Up the Data by Years 


The data were divided in several ways in order to examine 
the question of possible changes in the regular or routine 
pattern of budget outcomes in 1960 andin1969. First, the data 
from FY60 through FY68 were segregated and analyzed using the 
Huber "M" technique. The remaining data were analyzed 
as a block. Application of the Huber "M" technique to the 
FY53-FY59 and FY69-FY73 data revealed that the Group 2 and 
Group 3 residuals were always composed of items from the 
Fifties. This led us to believe it may be useful for estima- 


tion purposes to divide the data into three parts or eras: 


FY53-FY59 FY60-FY68 FY69-FY 73 
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Procurement 


Since the data contains requests by the three services 
by year, the number of data points for procurement in the 
three categories were 22*, 36 and 20 respectively. Table 
V.6 contains the results of the analysis of the three data 
sets for procurement. Table V.7 contains nominal 95% 
confidence intervals. These intervals were computed using 
Huber's method. Although the sample sizes are such that the 
normal approximation to the Student "t" is somewhat inaccurate, 
the over-coverage of the intervals reported earlier in the 
discussion of the Monte Carlo studies somewhat counterbalances 
this inaccuracy. 

The point estimates in Table V.6 are very revealing. For 
the 1960-1968 time frame, the estimated § is about 1.01 
(depending on the model used) while for IGS SFB the estimated 
B for the fixed percentage models is approximately 7 oO ae Lis 
represents a decrease of approximately .10 in the percentage 
of requests which have been appropriated in the 1969-1973 time 
period. For the fixed percentage models, (2.1) through (2.3), 
the results for 1953-1959 lie between the 1960-1968 and 1969-1973 
mesults. Also of interest inTable V.6 isthe fact that for the 
variable percentage models (2.4) and (2.5) using 1960-1968 data 


estimated «@ is at or near 1.0. This says that the percentage 


*The straggler 1953 PAMN was omitted from consideration. 
Also, 1953 Air Force Aircraft when appropriations totalled 
8 billion was also eliminated from consideration. 
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of the request appropriated is not sensitive to the size 

of the request. Recall that for the analysis of specifica- 
tion error the results using (2.1) through (2.5) were on 

the average the same when the data were generated by (2.3), 

a fixed percentage model. It appears that a fixed percentage 
model is most appropriate for the 1960-1968 period. 

The interval estimates in Table V.7 contain three 
interesting results. (i) All of the confidence intervals 
for (2.1) through (2.5) in the 1953-1959 period are quite 
tlarge. For example, the interval for 8 in (2.3) is 
005,1.049), the interval for (2.4) is (.317,19.314). 
Despite the fact that the Huber "M" technique was forcing 
items like 1954 AF A/C (request of $4283 million and 
appropriations of $2453 million) and 1959 PEMA (request of 
$970 million and appropriations of $1669 million) into the 
Group 2 and Group 3 residuals, the estimates for this period 
are very unstable. While in general our models are probably 
too simple to describe what is really happening, this is 
especially true for the 1953-1959 time period in the procure- 
ment area. There are a number of explanations for this. 
ferece tats period includes the end of the Korean War and 
Phe subsequent change in military policy to that of massive 


retaliation [Crecine and Fischer,1971l,pp. 21-23] with less 


emphasis on conventional forces. A Congress unwilling 
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to completely adhere to such a policy might respond@ by addime 
dollars in’the PEMA account and cutting some of the dollars 
from the Air Force accounts. On the other hand, by 1960 and 
certainly during the first Kennedy budgets, the administration 
policy provided for more dollars for all the Services [Crecine 
and Fischer,1971,pp. 4-5]. This fact together with the Congres- 
Sional unwillingness to cut procurement during the Vietnam 
conflict account for the greater stability of the 1960-1968 
estimates. The 1953-1959 data may possibly be better explained 
by a complex gaming model in which prior year cuts are taken 
into account, although there is no special reason why this 
should be the case. (ii) A second interesting aspect of 

Table V.7 is the length of the intervals for (2.4) and (2.5) 
for the 1969-1973 time frame. A comparison of these intervals 
jiemehose for (2.1) through (2.3) suggests that @ fixed 
percentage model may be most appropriate for this period, if 
only because of the instability of the estimates for (2.1) 
Saeeueh (2.3). (iii) Finally, note that the intervals for 

the fixed percentage models,(2.1) through (2.3), for 1960-1968 
do not overlap with the intervals for 1969-1973. Although 
this result does not represent the result of a classical 
statistical test, it does strongly support the contention that in 
the area of procurement the regular part of 1969-1973 budget 
outcomes in procurement follows a pattern Gifferent from that 
followed by the regular or routine part of 1960-1968 budget 


outcomes. 
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RDTE 


Tables V.8 and V.9 contain point estimates and nominal 
95% confidence interval estimates for the RDTE data when 
broken into the segments 1953-1959, 1960-1968, 1969-1973. 
The sample sizes were 21, 27 and 15 respectively. Again 
the normal approximation causes some inaccuracy in 
that the true coverage is -not quite 95%, especially for the 
Sample of size 15. However, the over-coverage of Huber's 
Suggested technique for constructing confidence intervals 
which was reported in previous sections somewhat counter- 
balances this inaccuracy. 

The results in Table V.8 show that the point estimates 
for the 1969-1973 time frame in the fixed percentage models 
are smaller by approximately 5 percentage points than those 
for 1960-1968 (and 1953-1959). The positive elasticities 
resulting from estimates of a in (2.4) and (2.5) of 1.05 in 
the 1969-1973 data are somewhat puzzling. However, the 
confidence intervals for a in Table V.9 are unusually wide, 
which indicates that the estimates in Table V.8 should 
probably not be taken too seriously. As with the procurement 
data, the confidence intervals for 8 in (2.1) through (2.3) 
for the 1960-1968 data do not overlap with the intervals for 
the 1969-1973 data. Finally the similarity between the 
estimates (point and interval) for 1960-1968 data and 1953-1959 
feipauscing (2,1) through (2.3), the fixed percentage models, 


is almost as interesting as the difference between the 
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BereMates uSing (2.4) and (2.5), the variable percentage 
models. A further refinement of the data, that is separating 


out 1957-1969 AF, will clarify the matter. 


V.4.3. The Air Force and RDTE 


Analysis of residuals in Chapters II and III has indi- 
cated that Air Force RDTE 1957-1969 should probably be 
separated fromthe rest of the data. In nine of these thirteen 
years the Congress appropriated more than was requested. 
Tables V.10 and V.11 contain point estimates and confidence 
movervals for the following RDITE data proups: Air Horce 
(1957-1959), Army, and Navy (1960-1968), Army, Navy and Air 
Force (1953-1959, except 1957-1959 Air Force), Army, Navy 
and Air Force (1959-1973, except 1969 Air Force). 

There are several interesting results in Tables V.10 and 
V.11 which need elaboration. (i) For the first time in the 
Gata analyzed thus far there is a noticeable difference in 
mre results for (2.1) and those for (2728 mas(2..) ler 
the 1957-69 Air Force data, the estimate for B in (2.1) is 
Mmeol? while that for (2.2) is 1.066 -oeieemmation of the data 
DOints indicates that six of the thirteen Xe values are 
smaller than 1.6 billion (4 being less than .750 billion) 
while the other seven Xe values are greater than 3.0 billion. 
This wide range of Xy values leads us to believe that for this 
Gata a model which allows some interdependence between Xy 
mead the disturbance, like (2.2) or (2.3), may be preferable 


to one which doesn't. A similar observation is relevant when 
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considering differences between the results for (2.4)-(2.5) 
and those for (2.2)-(2.3). The variable percentage models 
allow some interdependence between request size and the 
percentage of the request appropriated. In most of the data 
sets analyzed so far in this section, the request size has 
not varied enough to lend credence to any estimated inter- 
dependence. This is not true for the 1957-1969 Air Force 
data. However, the confidence intervals in Table V.11 for 

B and a in (2.4) and (2.5) are rather wide when compared to 
the intervals for 8 in (2.2) and (2.3). Although the Monte 
Carlo studies reported earlier indicate that even if the data 
were generated by (2.4) the value of the variance of 2 

when using (2.4) for estimation is higher than the value 

of the variance of 8B when using (2.2) or (2.3) for estimation, 
the differences were not as great as those found in Table V.11. 
(ii) A second interesting result is asscciated with the 
1969-1973 data set. The elimination of one data point, 

1969 Air Force, from the data set has a significant impact 
upon the estimates for a and B in (2.4) and (2.5), and a 
somewhat smaller impact on the estimates for 8 in the fixed 
percentage models (2.1) through (2.3), [Compare Table V.10 with 
Table V.8]. Of the fifteen data points between 1969-1973 

all represented reductions by the Congress except for 1969 
Air Force, where appropriations were teen eo On and the 
request was $3.36 billion. The Huber "Mw" technique made this 
item a Group 2 residual, thus reducing its ipaes on thie 


estimate. However, as a comparison of Tables V.8 and V.10 
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indicate, this observation still had a significant effect 

on the estimates. This appears to be an instance where the 
more severe @ functions such as those suggested by Andrews 

and Hampel or by Tukey [Andrews et al. , 1972] might 

produce estimates which are preferable to the Huber estimates. 
With respect to Table V.10 it should also be noted that for 
the 1969-1973 data set (without 1969 Air Force) a fixed 
percentage model appears to be most appropriate since the 
estimates for a in (2.4) and (2.5) are approximately equal 

won l.0, 

(iii) Insofar as confidence intervals are concerned, the 
confidence intervals (using (2.1) through (2.3)) for the 
1969-1973 data (without 1969 Air Force) do not overlap with 
those for the 1960-1968 data (without Air Force) nor with 
those from the 1957-1969 Air Force data. This reinforces the 
contention that in RDTE, as well as in procurement, the 
relationship of appropriations to requests changed in 1969 

or 1970. Also, there is only a slight overlap between the 
confidence intervals (using (2.2)-(2.3)) for the 1957-1969 Air 
Force data and the confidence intervals for the 1960-1968 data 
(without Air Force). The same holds true for the 1957-1969 
Air Force data and the 1953-1959 data (without 1957-1959 

Air Force). 

(iv) Finally, the estimates (point and interval estimates) for 
the 1953-1959 data (without 1957-1959 Air Force) and the 1960-196 


data (without Air Force) are very similar when Disatore2 (6 2s Ike 
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through (2.3). However, this is not true for (2.4) and (2.5), 
the variable percentage models. Closer inspection of the 
data shows that the values for the requests in each data set 
taken alone were all very similar. The requests in the 
pre-1960 data were all less than 1 billion while those for 
1960-1968 were all greater than 1 billion. It appears that 
this small dispersion of the requests in each of these data 
sets resulted in unstable and unreliable estimates for a and 
B in (2.4) and (2.5). Table V.12 contains the results when 
the 1953-1968 data (without 1957-1968 Air Force) were 
analyzed as a unit. The small confidence intervals for all 
of the coefficients leads us to believe that this data should 
be treated as a unit. Of interest, also, ase vie acc noc 
fate.) and (2.5) is estimated tcube approximately equal fo 
1.0 which means that even though the data contained rather 
widely dispersed Xy values, the percentage of the request 
appropriated did not appear to be sensitive to the size of 
mace request. 

Some tentative explanations for the results in the RDTE 
Mrea are probably in order at this Gimeeuioee oi ces ma, Or 
results are that for the 1957-1969 Air Force data the estimated 
8 was approximately 1.06. For the 1953-1968 data (without 
1957-1968 Air Force) the estimated 8 was near 1.0, while for 
the 1969-1973 data (without 1969 Air Force) the estimated 8 
was approximately .94. The first resuit, the 6 of IOS Siler c. 
the Air Force in the 1957-1969 period, is probably associated 


with the missile gap fears starting in the late 1950's when 
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the Soviets orbitted the first Sputniks and with the dispute 
between Secretary McNamara and the Armed Services Committees 
over the B-70 bomber in the mid-1960's. The Secretary 
believed the bomber was unnecessary while the Committees 
disagreed. With respect to the apparent "rubber stamping" 
in the 1953-1968 period, except for the 1957-1969 Air Force, 
there are two possible explanations. The first is that prior 
to fiscal 1963 authorizations were not required for RDTE. 
Consequently the Armed Services Committees were not examining 
budget requests. Until recently, one of the characteristics 
of RDTE budget requests was that it was difficult to pinpoint 
and identify individual projects for which funding was being 
requested. Finally, prior to the 1960's, RDTE was a small 
percentage of the overail Defense budget request, especially 
Army and Navy RDTE. Thus, since the Defense Apvropriations 
Subcommittees in the House and Senate were the only ones 
examining the requests prior to 1963, since the requests did 
not clearly identify projects to be funded and since the 
amount of money involved was not too great, it is reasonable 
to conclude that a subcommittee might essentially "rubber 
stamp" a request. The 1964-1968 approvals of RDTE requests 
are probably associated with the Congressional reluctance 
to cut the Defense budget during the Vietnam buildup. 

The third major result of the analysis of RDTE data was 
the decrease in appropriations as a percentage of requests 
in the 1969-1973 time frame. By 1969, the Armed Services 


Committees had been reviewing authorization requests for 
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five years and had developed some Gxpervisewin cirs career 
Also, by 1969 the budget backup data which stated what the 
funds were to be used for was organized into a number 
relatively visible and identifiable units [Armed Services 
Committee Print]. These factors, together with public 
dissatisfaction with military spending in general, appear to 
have contributed to the change in 1969 and its persistence 


up to the current time. 
V.5. Summary and Conclusions 


This chapter brings to a close our present exploratory 
analysis of Defense appropriations in procurement and RDTE 
aS they depend upon requests. Alternative ways of specifying 
what has been termed the regular aspects of budget outcomes 
were discussed in Chapter II. The remainder of the analysis 
has consisted of attempts to probe more deeply into the 
questions of how to best specify these regular aspects and how 
best to estimate them. The Huber "M" technique was shown to 
be, in many instances, quite suitable for estimation purposes 
because of its designed insensitivity to unusual or irregular 
Gata points or occurrences. The question of contidence 
intervals was explored in Chapter IV, while small sample 
properties and specification error were explored in the first 
part of Chapter V. The discussion in IV.4 in general favored 
models which state that appropriations are a fixed percentage 


of me@@eats, (2.1)~(2.3). The models (2.1)-(2.3) differ among 
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themselves in the way the random disturbance enters their 
specification. Tables V.13 and V.14 contain coefficient 
estimates that were chosen as the most appropriate models. 
The estimates in Tables V.13 and V.14 in no way represent 
the final word on the topic of Congressional appropriations 
as a percentage of requests. Hopefully, future research will 
probe more deeply into questions such as the following. Why 
are the estimated percentages at, or near, the values in 
Tables V.13 and V.14? What caused the change in 1969 and 
can such changes be predicted? Why was the Air Force so 
successful in 1957-1969 in the RDTE area? 
This analysis has made no attempt to include formulations 
which treat the percentage of the request appropriated in 
year t as being related to the percentage appropriated in 
year t-l. Formulations of this kind are a natural extension 
of this analysis. Another area of future research is related 
to individual committee action. Although this analysis has 
examined appropriations granted by the Congress as a whole, 
a similar analysis could be performed on Armed Services 
Committee and Defense Appropriations Subcommittee recommendations. 
One very interesting question is concerned with the Kanter- 
DDW dispute and probably offers one of the more promising 
avenues for future research. Are the percentages in Tables 
V.13 and V.14 the result of a conscious effort on the part of 
the Congress to make percentage adjustments in budget requests, 


or are they the sum total of non-percentage type adjustments 
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Best Model and Estimate 


1953-1959 None 
u, rn 
1960-1968* (2.3) V4 i Bx, € 8 = 1.010 
Uy zn 
1969-1973% (252) ye Bx, © Fee (oul 
TABLE V.13 
Procurement 


pve=2) and, to acertain extent (2.1). etse provide adequate 
representations of the data. 


ct 
ct 
WD 
il 
\O 
\O 
WW 


1953-1968 (without (2.3) y, = Bx,e 
1957-1968 AF)* 


1957-1969 GaN yas Bx," ‘ B = 1.919, a = .922 
Air Force#* _ - 

O1ge (25 8) spa Bx,e t 8B = 1.063 

oh A 
1969-1973 (without (2.3) Y= eee B= .935 
1969 Air Force 
TABLE V.14 
RDTE 


*(2.2) and (2.1) also provide adequate representations of 

the data. The estimates for 8B using (2.2) and (2.1) were 

equivalent to those of (2.3). 

¥¥(2,.2) but not (2.1) also provides an adequate representation 
Omecne Gata . 

¥*#*¥PThe similarity between the results for (2.2) and (2.3) can 
be explained by recalling the discussion in Chapter II of 
miescemimodels. Recall that fer (254) 


2 5 
Y= p+ puted +e b+... , while for (2.2) L=6 te, 


with u and e€ being random. The two representations are 
very similar if B is near one, and u is small. 
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in budget requests, or are they the sum total of non-percentage 
type adjustments at a lower level of aggregation (1.e. cuts 
in specific weapon systems) by a Congress which has only 


limited time and resources to review budget requests? 
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APPENDIX A 


In Section V.3, data was generated by (1) model (2.3), 
(114) model (2.4) ana (iii) model (2.3) with autocorrelation. 
ere disturbances, Uz» were Cauchy distributed in each case. 
Estimates of the coefficient 8 (and a where appropriate) 
were computed using (2.1)-(2.5). Since 1000 samples of 
size 63 were generated, it was possible to examine the 
distributions of the estimates for 8 (and a). Probability 
plots were constructed for the 1000 estimates of 8 (and a) 
when using (2.1)-(2.5) under conditions (i),’ (ii) and (iii). 


Piso, ctvhe hycotvhesis tests. c- and #b, [Pearson and Hartley, 


2 
1966,pp. 67-68], were made in order to determine whether or 
not it was possible to reject a hypothesis of normality. 

In an extensive Monte Carlo study made by Wilk, Shapiro 
and Chen [1968], qb, performed well in comparison to other 
standard goodness-of-fit tests for detecting non-normality 
under a wide variety of circumstances. The test which 
performed best was the W test [Wilk and Shapiro,1965]. However 
the W test is designed for samples of 50 or less. In no case was 
it possible to reject the null hypothesis, using b, or fo, 
For example, when the data were generated by means of (2.3) 
with autocorrelation and model (2.3) was used for estimation 
Pee es. the value obtained for bo-3; using the estimated 


coefficients from 1000 samples, was 7.11079. The upper 
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5% point, as tabulated in [Pearson and Hartley,1966,p. 208] 


Hs 2.6x10 1+, This was typieal of themrecull Len opceeie eee 


whe tests. 

Examples of the probability Bote are contained in this 
appendix. The plots on pages A-3 and A-4 are typical of 
the plots obtained. Almost all of the plots appeared linear, 
like that on page A-3, where the data was generated by (2.3) 
and estimates computed using (2.3). The plots, like that on 
page A-4 were not quite so linear when autocorrelation 
(data generated by (2.3) with autocorrelation) was introduced 


on top of specification error (estimates computed using (2.4)). 
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